Subject Matter Expert (Support&Ops)
India
Job Description
Subject Matter Expert (Support&Ops)
Sholinganallur, Tamil Nadu

Job Summary

Key Skills & Requirements

  • Strong hands-on experience in Grafana administration, including dashboard development, alert configuration, notification policies, RBAC, user management, and data source integration.
  • Expertise in Grafana plugin installation, configuration, troubleshooting, upgrades, and performance optimization across enterprise-scale monitoring environments.
  • Experience designing and maintaining observability solutions using Grafana Alloy, Grafana and OpenTelemetry frameworks.
  • Hands-on experience with Grafana Alloy configuration, telemetry collection pipelines, log/metric forwarding, relabeling, filtering, and performance tuning.
  • Strong knowledge of BindPlane administration, including collector deployment, gateway configuration, telemetry routing, load balancing, high availability, and troubleshooting.
  • Experience configuring and optimizing telemetry ingestion pipelines from on-premises and cloud-based infrastructure into centralized observability platforms.
  • Good understanding of Google Cloud Platform (GCP) services, with hands-on experience in GKE cluster administration, workload deployment, pod management, scaling, and troubleshooting.
  • Experience using Google Cloud Monitoring tools such as Metrics Explorer, Logs Explorer, dashboards, alerting policies, and observability best practices.
  • Strong Kubernetes administration skills, including deployments, services, ingress controllers, daemonsets, statefulsets, namespaces, resource management, and cluster troubleshooting.
  • Experience managing and monitoring Azure Kubernetes Service (AKS) environments and implementing observability solutions for containerized workloads.
  • Knowledge of Azure cloud services, networking concepts, identity management, and infrastructure monitoring.
  • Hands-on experience with Ansible for infrastructure automation, configuration management, deployment automation, and operational tasks.
  • Strong scripting and automation skills using Python and Shell Scripting for monitoring,  API integrations, and operational efficiency improvements.
  • Experience integrating monitoring platforms with ServiceNow, REST APIs, webhook-based alerting, SQL , and third-party enterprise applications.
  • Strong understanding of Linux system administration, troubleshooting, process management, networking fundamentals, and performance analysis.
  • Ability to perform root cause analysis, capacity planning, performance optimization, and reliability improvements for large-scale monitoring platforms.
  • Experience supporting enterprise observability environments with thousands of monitored servers, applications, and cloud-native workloads.
  • Excellent analytical, troubleshooting, documentation, and stakeholder communication skills. 

Cloud & Container Technologies

  • Google Cloud Platform (GCP)/Google Kubernetes Engine (GKE)
  • Kubernetes Administration
  • Azure Cloud/Azure Kubernetes Service (AKS)

Monitoring & Observability

  • Grafana
  • Grafana Alloy
  • OpenTelemetry
  • BindPlane
  • Cloud Monitoring
  • Log Management Solutions
  • Prometheus

Automation & Development

  • Python
  • Shell Scripting (Bash)
  • Ansible
  • REST APIs
  • Git/GitHub 

 

Key Responsibilities

Key Skills & Requirements

  • Strong hands-on experience in Grafana administration, including dashboard development, alert configuration, notification policies, RBAC, user management, and data source integration.
  • Expertise in Grafana plugin installation, configuration, troubleshooting, upgrades, and performance optimization across enterprise-scale monitoring environments.
  • Experience designing and maintaining observability solutions using Grafana Alloy, Grafana and OpenTelemetry frameworks.
  • Hands-on experience with Grafana Alloy configuration, telemetry collection pipelines, log/metric forwarding, relabeling, filtering, and performance tuning.
  • Strong knowledge of BindPlane administration, including collector deployment, gateway configuration, telemetry routing, load balancing, high availability, and troubleshooting.
  • Experience configuring and optimizing telemetry ingestion pipelines from on-premises and cloud-based infrastructure into centralized observability platforms.
  • Good understanding of Google Cloud Platform (GCP) services, with hands-on experience in GKE cluster administration, workload deployment, pod management, scaling, and troubleshooting.
  • Experience using Google Cloud Monitoring tools such as Metrics Explorer, Logs Explorer, dashboards, alerting policies, and observability best practices.
  • Strong Kubernetes administration skills, including deployments, services, ingress controllers, daemonsets, statefulsets, namespaces, resource management, and cluster troubleshooting.
  • Experience managing and monitoring Azure Kubernetes Service (AKS) environments and implementing observability solutions for containerized workloads.
  • Knowledge of Azure cloud services, networking concepts, identity management, and infrastructure monitoring.
  • Hands-on experience with Ansible for infrastructure automation, configuration management, deployment automation, and operational tasks.
  • Strong scripting and automation skills using Python and Shell Scripting for monitoring,  API integrations, and operational efficiency improvements.
  • Experience integrating monitoring platforms with ServiceNow, REST APIs, webhook-based alerting, SQL , and third-party enterprise applications.
  • Strong understanding of Linux system administration, troubleshooting, process management, networking fundamentals, and performance analysis.
  • Ability to perform root cause analysis, capacity planning, performance optimization, and reliability improvements for large-scale monitoring platforms.
  • Experience supporting enterprise observability environments with thousands of monitored servers, applications, and cloud-native workloads.
  • Excellent analytical, troubleshooting, documentation, and stakeholder communication skills. 

Cloud & Container Technologies

  • Google Cloud Platform (GCP)/Google Kubernetes Engine (GKE)
  • Kubernetes Administration
  • Azure Cloud/Azure Kubernetes Service (AKS)

Monitoring & Observability

  • Grafana
  • Grafana Alloy
  • OpenTelemetry
  • BindPlane
  • Cloud Monitoring
  • Log Management Solutions
  • Prometheus

Automation & Development

  • Python
  • Shell Scripting (Bash)
  • Ansible
  • REST APIs
  • Git/GitHub 

 

Skill Requirements

null

Other Requirements

null
Information at a Glance

Why HCLTech?

At HCLTech, you'll supercharge your potential. You'll find your career. And you'll find your spark. All at a place that knows that helping its customers stay on top starts by putting its people first.

HCLTech is a global technology company, home to more than 226,300 people across 60 countries, delivering industry-leading capabilities centered around digital, engineering, cloud and AI, powered by a broad portfolio of technology services and products. We work with clients across all major verticals, providing industry solutions for Financial Services, Manufacturing, Life Sciences and Healthcare, Technology and Services, Telecom and Media, Retail and CPG, and Public Services. Consolidated revenues as of 12 months ending December 2025 totaled $14.5 billion.

23 Benefits At HCLTech, we believe in empowering our employees with comprehensive benefits that support their professional growth and enhance their well-being. When you sign up for a career with us, you gain access to: https://rmkcdn.successfactors.com/147eb21f/a701dca9-f32d-4fc9-9447-6.svg Industry-benchmarked compensation https://rmkcdn.successfactors.com/147eb21f/b0c54381-ddcc-4a33-9b35-9.svg Best-in-class healthcare benefits https://rmkcdn.successfactors.com/147eb21f/b73027be-7aae-4d36-a090-4.svg Personal time off https://rmkcdn.successfactors.com/147eb21f/d5b4fdfd-2e99-4e26-9878-9.svg Maternity and paternity benefits https://rmkcdn.successfactors.com/147eb21f/3d42b0fc-4652-435a-9ece-c.svg Access to skills / higher education programs/resources https://rmkcdn.successfactors.com/147eb21f/aeddeaf2-9e25-4584-ad11-d.svg Discounts on products and services via Benefit Box https://rmkcdn.successfactors.com/147eb21f/a9609a3b-2700-4b3c-9d90-a.svg Participate in CSR programs and live life with a purpose https://rmkcdn.successfactors.com/147eb21f/c6e33851-710f-4634-bd69-f.svg Opportunities to grow and advance your career Note: The benefits listed above vary depending on the nature of your employment and the country where you work. Some benefits may be available in some countries but not in all.