Job Summary
Event Monitoring and Management - Job Description Role Summary The Event Monitoring and Management Analyst is responsible for real-time monitoring, correlation, and management of IT events across infrastructure, applications, and services. The role ensures early detection of incidents, minimizes service disruptions, and supports proactive incident and problem management. Key Responsibilities Monitor IT infrastructure, applications, and services using event monitoring tools (e.g., Splunk, Dynatrace, SCOM, Grafana) Analyze, correlate, and prioritize events to identify potential incidents Perform event triage and categorize alerts based on severity and business impact Trigger incident tickets in ITSM tools (ServiceNow, Remedy) as per defined processes Ensure timely response and escalation in line with SLAs Reduce alert noise through event correlation, suppression, and tuning Collaborate with support teams (network, server, application) for issue resolution Support root cause analysis (RCA) and problem management activities Maintain event monitoring dashboards and reporting metrics Ensure compliance with ITIL Event Management practices Required Skills Strong understanding of IT infrastructure (network, server, cloud, applications) Hands-on experience with monitoring and observability tools Knowledge of ITIL framework (Event, Incident, Problem Management) Experience with ticketing tools (ServiceNow, Remedy) Ability to analyze logs, alerts, and system behavior Good analytical and troubleshooting skills Strong communication and documentation skills Preferred Qualifications Bachelor’s degree in IT, Computer Science, or related field ITIL certification (Foundation or above) Experience in 24x7 NOC/SOC or Event Monitoring environment Exposure to automation and scripting (PowerShell, Python) is an advantage Key KPIs Mean Time to Detect (MTTD) Mean Time to Acknowledge (MTTA) Mean Time to Resolve (MTTR) Event-to-Incident conversion accuracy Reduction in false positives / alert noise SLA compliance for event response Tools & Technologies Monitoring: Splunk, Dynatrace, Nagios, Zabbix, SolarWinds ITSM: ServiceNow, Remedy Logging & Observability: ELK Stack, Grafana Collaboration Tools: MS Teams, Email, Incident bridges
Key Responsibilities
Event Monitoring and Management - Job Description Role Summary The Event Monitoring and Management Analyst is responsible for real-time monitoring, correlation, and management of IT events across infrastructure, applications, and services. The role ensures early detection of incidents, minimizes service disruptions, and supports proactive incident and problem management. Key Responsibilities Monitor IT infrastructure, applications, and services using event monitoring tools (e.g., Splunk, Dynatrace, SCOM, Grafana) Analyze, correlate, and prioritize events to identify potential incidents Perform event triage and categorize alerts based on severity and business impact Trigger incident tickets in ITSM tools (ServiceNow, Remedy) as per defined processes Ensure timely response and escalation in line with SLAs Reduce alert noise through event correlation, suppression, and tuning Collaborate with support teams (network, server, application) for issue resolution Support root cause analysis (RCA) and problem management activities Maintain event monitoring dashboards and reporting metrics Ensure compliance with ITIL Event Management practices Required Skills Strong understanding of IT infrastructure (network, server, cloud, applications) Hands-on experience with monitoring and observability tools Knowledge of ITIL framework (Event, Incident, Problem Management) Experience with ticketing tools (ServiceNow, Remedy) Ability to analyze logs, alerts, and system behavior Good analytical and troubleshooting skills Strong communication and documentation skills Preferred Qualifications Bachelor’s degree in IT, Computer Science, or related field ITIL certification (Foundation or above) Experience in 24x7 NOC/SOC or Event Monitoring environment Exposure to automation and scripting (PowerShell, Python) is an advantage Key KPIs Mean Time to Detect (MTTD) Mean Time to Acknowledge (MTTA) Mean Time to Resolve (MTTR) Event-to-Incident conversion accuracy Reduction in false positives / alert noise SLA compliance for event response Tools & Technologies Monitoring: Splunk, Dynatrace, Nagios, Zabbix, SolarWinds ITSM: ServiceNow, Remedy Logging & Observability: ELK Stack, Grafana Collaboration Tools: MS Teams, Email, Incident bridges
Skill Requirements
Event Monitoring and Management - Job Description Role Summary The Event Monitoring and Management Analyst is responsible for real-time monitoring, correlation, and management of IT events across infrastructure, applications, and services. The role ensures early detection of incidents, minimizes service disruptions, and supports proactive incident and problem management. Key Responsibilities Monitor IT infrastructure, applications, and services using event monitoring tools (e.g., Splunk, Dynatrace, SCOM, Grafana) Analyze, correlate, and prioritize events to identify potential incidents Perform event triage and categorize alerts based on severity and business impact Trigger incident tickets in ITSM tools (ServiceNow, Remedy) as per defined processes Ensure timely response and escalation in line with SLAs Reduce alert noise through event correlation, suppression, and tuning Collaborate with support teams (network, server, application) for issue resolution Support root cause analysis (RCA) and problem management activities Maintain event monitoring dashboards and reporting metrics Ensure compliance with ITIL Event Management practices Required Skills Strong understanding of IT infrastructure (network, server, cloud, applications) Hands-on experience with monitoring and observability tools Knowledge of ITIL framework (Event, Incident, Problem Management) Experience with ticketing tools (ServiceNow, Remedy) Ability to analyze logs, alerts, and system behavior Good analytical and troubleshooting skills Strong communication and documentation skills Preferred Qualifications Bachelor’s degree in IT, Computer Science, or related field ITIL certification (Foundation or above) Experience in 24x7 NOC/SOC or Event Monitoring environment Exposure to automation and scripting (PowerShell, Python) is an advantage Key KPIs Mean Time to Detect (MTTD) Mean Time to Acknowledge (MTTA) Mean Time to Resolve (MTTR) Event-to-Incident conversion accuracy Reduction in false positives / alert noise SLA compliance for event response Tools & Technologies Monitoring: Splunk, Dynatrace, Nagios, Zabbix, SolarWinds ITSM: ServiceNow, Remedy Logging & Observability: ELK Stack, Grafana Collaboration Tools: MS Teams, Email, Incident bridges
Other Requirements
Event Monitoring and Management - Job Description Role Summary The Event Monitoring and Management Analyst is responsible for real-time monitoring, correlation, and management of IT events across infrastructure, applications, and services. The role ensures early detection of incidents, minimizes service disruptions, and supports proactive incident and problem management. Key Responsibilities Monitor IT infrastructure, applications, and services using event monitoring tools (e.g., Splunk, Dynatrace, SCOM, Grafana) Analyze, correlate, and prioritize events to identify potential incidents Perform event triage and categorize alerts based on severity and business impact Trigger incident tickets in ITSM tools (ServiceNow, Remedy) as per defined processes Ensure timely response and escalation in line with SLAs Reduce alert noise through event correlation, suppression, and tuning Collaborate with support teams (network, server, application) for issue resolution Support root cause analysis (RCA) and problem management activities Maintain event monitoring dashboards and reporting metrics Ensure compliance with ITIL Event Management practices Required Skills Strong understanding of IT infrastructure (network, server, cloud, applications) Hands-on experience with monitoring and observability tools Knowledge of ITIL framework (Event, Incident, Problem Management) Experience with ticketing tools (ServiceNow, Remedy) Ability to analyze logs, alerts, and system behavior Good analytical and troubleshooting skills Strong communication and documentation skills Preferred Qualifications Bachelor’s degree in IT, Computer Science, or related field ITIL certification (Foundation or above) Experience in 24x7 NOC/SOC or Event Monitoring environment Exposure to automation and scripting (PowerShell, Python) is an advantage Key KPIs Mean Time to Detect (MTTD) Mean Time to Acknowledge (MTTA) Mean Time to Resolve (MTTR) Event-to-Incident conversion accuracy Reduction in false positives / alert noise SLA compliance for event response Tools & Technologies Monitoring: Splunk, Dynatrace, Nagios, Zabbix, SolarWinds ITSM: ServiceNow, Remedy Logging & Observability: ELK Stack, Grafana Collaboration Tools: MS Teams, Email, Incident bridges