Job Summary
Key Responsibilities
2. Optimize batch job monitoring processes with tools like Control-M and Autosys, implementing advanced scheduling and alerting strategies to minimize downtime and SLA breaches.
3. Drive continuous improvement initiatives in monitoring workflows and escalation procedures, leveraging ITIL frameworks and automation platforms to enhance operational efficiency.
4. Guide and mentor the monitoring team in best practices for event correlation, incident triage, and root cause analysis using platforms such as ServiceNow and BMC Remedy.
5. Collaborate with stakeholders to align monitoring solutions with evolving client requirements, delivering tailored dashboards and reporting via tools like Grafana and Kibana.
6. Innovate monitoring processes by evaluating and integrating emerging technologies, ensuring the command center remains at the forefront of operational excellence.
7. Ensure compliance with security and governance standards in all monitoring and event management activities, utilizing SIEM solutions where appropriate.
Skill Requirements
2. Advanced Proficiency In Automation Scripting (Python, Powershell, Shell) For Monitoring Optimization And Workflow Automation.
3. Excellent Ability To Design, Implement, And Optimize Dashboards And Reporting Using Grafana, Kibana, Or Similar Tools.
4. Strong Expertise In Root Cause Analysis, Event Correlation, And Escalation Management Using Servicenow Or Bmc Remedy.
5. Excellent Leadership And Mentoring Skills For Guiding Technical Teams In HighPressure Operational Settings.
6. Advanced Proficiency In Aligning Monitoring Solutions With Business Objectives And Client Slas.
Other Requirements
2. Certified in IBM Netcool, Splunk, or equivalent monitoring platforms (optional but valuable)
3. ControlM or Autosys certification (optional but valuable