Job Summary
Platform Management: Oversee APM (Application Performance Monitoring) and Infrastructure monitoring stacks. Incident & RCA Management: Define alerting thresholds, reduce alert fatigue, and lead Problem Management/Root Cause Analysis. Team Leadership: Mentor engineers and manage global shift rotations for continuous monitoring. Service Level Agreements (SLA): Ensure systems meet uptime targets and compliance standards Operations Management with focus on continuous improvement ,problem-solving, meeting client SLAs, and empowering teams through effective people management.
Key Responsibilities
1. Enhance operational systems to facilitate improved management reporting, streamline information flow, optimize business processes, and support organizational planning.
2. Understand client requirements and accountable in ensuring support team is meeting client expectations.
3. To lead and mentor the project team and ensure transparent communication of project goals.
4. Brining new ideas and innovation for process development and overall organizational progress.
5. To provide solutions commensurate with the customersâ needs within the ambit of the given environment so as to lead to business results.
Platform Management: Oversee APM (Application Performance Monitoring) and Infrastructure monitoring stacks. Incident & RCA Management: Define alerting thresholds, reduce alert fatigue, and lead Problem Management/Root Cause Analysis. Team Leadership: Mentor engineers and manage global shift rotations for continuous monitoring. Service Level Agreements (SLA): Ensure systems meet uptime targets and compliance standards
Platform Management: Oversee APM (Application Performance Monitoring) and Infrastructure monitoring stacks.\\\\r\\\\nIncident & RCA Management: Define alerting thresholds, reduce alert fatigue, and lead Problem Management/Root Cause Analysis.\\\\r\\\\nTeam Leadership: Mentor engineers and manage global shift rotations for \\\\r\\\\n\\\\r\\\\n continuous monitoring.\\\\r\\\\nService Level Agreements (SLA): Ensure systems meet uptime targets and compliance standards
Skill Requirements
Platform Management: Oversee APM (Application Performance Monitoring) and Infrastructure monitoring stacks. Incident & RCA Management: Define alerting thresholds, reduce alert fatigue, and lead Problem Management/Root Cause Analysis. Team Leadership: Mentor engineers and manage global shift rotations for continuous monitoring. Service Level Agreements (SLA): Ensure systems meet uptime targets and compliance standards
Other Requirements
Platform Management: Oversee APM (Application Performance Monitoring) and Infrastructure monitoring stacks. Incident & RCA Management: Define alerting thresholds, reduce alert fatigue, and lead Problem Management/Root Cause Analysis. Team Leadership: Mentor engineers and manage global shift rotations for continuous monitoring. Service Level Agreements (SLA): Ensure systems meet uptime targets and compliance standards