Job Summary
Enterprise Observability & Monitoring: Manage Elastic, Grafana dashboards, alerts, and monitoring coverage. 2. Vulnerability Management (Qualys): Track vulnerabilities, coordinate remediation, ensure SLA adherence. 3. Incident & Problem Management: Lead P1/P2 incidents, perform RCA, trend analysis. 4. ICC / Command Center Support: Monitor alerts, escalate incidents, support 24x7 monitoring ecosystem. 5. Automation & Integration: Implement monitoring automation using Terraform/Ansible. 6. Performance Monitoring: Ensure system health, capacity, and performance optimization. 7. Governance & Reporting: Maintain dashboards, reports, SOPs, and knowledge base.
Key Responsibilities
Provide L3 support, engineering oversight, and optimization for enterprise observability and vulnerability platforms (Qualys, Elastic, Grafana, etc.). Ensure proactive monitoring, incident resolution, and SLA compliance across global environments supporting ICC, SOC, and hybrid infrastructure.
Skill Requirements
Qualys (VMDR, vulnerability management) - Elastic Stack (ELK) for logs and observability - Grafana dashboarding and alerting - Monitoring tools (Datadog/SolarWinds) - Cloud and Infra Monitoring (AWS, Linux, Windows) - Scripting (Python/Shell)
Other Requirements
incident, Problem, Change Management, and SRE practices.