Job Description:
We are seeking a DevOps Support Engineer (Night Shift) to help ensure the smooth operation of our systems overnight. In this role, you will monitor infrastructure, assist with basic troubleshooting, apply updates and patches, and provide support for customer issues. If more complex issues arise, you’ll escalate them to senior engineers as needed. This is a key support role, ideal for individuals who are comfortable working independently during off-hours.
Key Responsibilities:
Infrastructure Monitoring & Maintenance:
Continuously monitor key infrastructure and services during the night shift to ensure operational excellence. Identify and resolve any issues that may affect system performance, availability, or security.
Troubleshooting & Incident Resolution:
Diagnose and resolve infrastructure-related incidents, ensuring minimal disruption to system operations. Take appropriate actions to rectify any issues, document resolution steps, and follow up on long-term solutions if required.
Patch Management & System Updates:
Assist with the deployment of critical patches, updates, and configuration changes to ensure systems are secure, stable, and up-to-date, in line with internal change management protocols.
Customer Support Coordination:
Address customer inquiries and issues during the night shift, coordinating closely with the support team. Ensure timely and professional communication with customers and provide solutions or escalate as needed to maintain high service standards.
Escalation & Collaboration:
For complex incidents or potential system-impacting events, work collaboratively with senior DevOps engineers or the team lead to analyze and resolve the situation effectively. Ensure seamless escalation and clear communication to minimize downtime and customer impact.
Handover & Communication:
Provide clear and concise updates to the day shift team on any unresolved incidents or ongoing tasks. Maintain a smooth transition by documenting critical information for the team to continue managing effectively.
Key Skills Required:
General DevOps Knowledge:
Familiarity with foundational DevOps tools and practices such as version control, CI/CD pipelines, and infrastructure management.
Containerization & Orchestration:
Understanding of containerization technologies (e.g., Docker) and container orchestration tools (e.g., Kubernetes) for effective troubleshooting and system management.
System Monitoring & Performance Tools:
Proficiency with monitoring and alerting tools like Prometheus, Grafana, or other infrastructure management platforms to identify system health and performance metrics.
Scripting & Automation:
Basic proficiency in scripting (e.g., Bash, Python) to automate tasks, manage configurations, and resolve issues efficiently.
Cloud Technologies:
General understanding of cloud platforms (e.g., AWS, Azure, GCP) and cloud-based infrastructure, including provisioning and managing resources.
AI & Automation Tools:
Familiarity with AI-based monitoring or automation tools that could assist in identifying issues or optimizing workflows would be a plus, but not a primary requirement.
Communication & Collaboration:
Strong communication skills to effectively work with customers, internal teams, and senior engineers. Ability to relay technical information in a clear and concise manner.
Problem-Solving & Escalation:
A methodical approach to diagnosing issues and determining when to escalate to senior team members. Prioritize tasks effectively based on business needs.
Willingness to Learn:
This role is ideal for someone who is eager to gain experience in DevOps and system support.
Education:
A Bachelor’s degree in Computer Science, Information Technology, or a related field, or equivalent work experience.
Experience:
0-1yr,Experience in Support Roles if previously, DevOps related role is required.
Should be comfortable in night shift.