Job Summary
Key Responsibilities
2. Collaborate with the engineering and development teams to implement efficient and scalable solutions that enhance system performance.
3. Develop and maintain support documentation, standard operating procedures, and best practices for the support team.
4. Identify opportunities for automation and implement tools to streamline support processes.
5. Monitor system performance and provide recommendations for improvements to optimize system reliability.
6. Participate in on call rotations to address critical incidents and ensure 24/7 system availability.
7. Conduct regular performance evaluations, provide feedback, and mentor team members to promote professional growth.
Skill Requirements
Detailed JD is as belowShould have more than 7 or more years of IT experience.
Should be well versed with Site reliability Engineering and ITIL concept.
Automation & DevOps Tools
Ansible (Playbooks)
JenkinsXLR (or similar orchestration tools)
AI/Automation tools (preferred)Version Control GitHub / BitbucketMonitoring & Observability
Splunk Dynatrace, OpenTelemetry (OTel)
Programming & Scripting
PythonShell ScriptingExperience with event-driven systems and streaming platforms is good to have.
Understanding of ITIL / Incident / Change Management processes. Should have good analytical skillTeam player, ready to work in 16*7 rotational shifts.