SME - RedHat Cluster, Ansible, Kubernetes, Microsoft Azure
India
Job Description
SME - RedHat Cluster, Ansible, Kubernetes, Microsoft Azure
Bengaluru, Karnataka

Job Summary

As a Subject Matter Expert in Support & Operations, you will play a pivotal role in ensuring the timely resolution of escalated incidents while adhering to quality norms and service level agreements (SLAs). Your expertise in Kubernetes, Ansible, and cloud technologies will be essential in driving customer satisfaction and operational excellence.

Key Responsibilities

==========================================================================================================================

Job Title: Platform Site Reliability Engineer (SRE) – OpenShift (8+ Years)

 

Location

Bangalore / Kolkata / Pune

 

Band

L2

 

Role Summary

We are looking for a Platform SRE (6+ years) to engineer, run, and continuously improve an OpenShift-heavy container platform. This role combines Day‑1 responsibilities (platform setup, standardization, onboarding enablement) with Day‑2 operations (stability, upgrades, performance, incident management, and automation).

 

Key Responsibilities

 

Day‑1 (Build / Enablement)

- Support OpenShift platform onboarding: cluster setup assistance, baseline configurations, and environment readiness.

- Implement platform standards: namespaces/projects, RBAC/SCC, resource quotas/limits, routes/ingress patterns, and operator enablement.

- Create reusable deployment patterns using Helm (standard charts/templates, values structure, versioning).

- Build and standardize GitLab CI templates/pipelines for build-test-deploy and environment promotion.

- Develop automation using Ansible to enable repeatable provisioning/configuration workflows.

 

Day‑2 (Run / Operate / Optimize)

- Own cluster health and reliability: monitoring, capacity planning, scaling, patching and upgrades, and performance troubleshooting.

- Troubleshoot issues across OpenShift components, nodes, networking/storage basics, and workload behaviour.

- Participate in incident response: triage, mitigation, RCA, post-incident actions, and runbook/SOP improvements.

- Reduce operational toil through automation, improved alerts, and self-service enablement for application teams.

- Collaborate with stakeholders to improve security posture and operational governance (access controls, platform hygiene).

 

Mandatory Skills

- 8+ years’ experience in SRE / DevOps / Platform / Infrastructure Engineering

- Strong hands-on OpenShift Administration

- Helm (deployments + chart maintenance; chart authoring preferred)

- Ansible (playbooks/roles; automation mindset)

- Linux fundamentals (logs, processes, system services, basic networking)

- CI/CD with GitLab CI (pipelines, runners, templates, variables/secrets)

 

Good-to-Have Skills

- ArgoCD (GitOps)

- vSphere, NSX, VMware Cloud Foundation (VCF)

- Exposure to observability stacks (Prometheus/Grafana, ELK/EFK, Splunk, Datadog, etc.)

 

Traits We Value

- Strong troubleshooting, ownership, and production support mindset

- Comfortable operating in structured on-call rotations and handling high-severity incidents

- Good documentation habits (runbooks, SOPs, RCA notes)

=================================

Skill Requirements

1. Proficient In Kubernetes And Containers Management.
2. Strong Understanding Of Ansible For Automation And Configuration Management.
3. Familiarity With Redhat Linux And Redhat Cluster Technologies.
4. Knowledge Of Azure Cloud Services And Their Integration With Kubernetes.
5. Excellent Analytical And Problem-Solving Skills, With A Focus On Customer Satisfaction And Operational Efficiency.

Other Requirements

1. Relevant Certifications Such As Certified Kubernetes Administrator (Cka) Or Red Hat Certified Engineer (Rhce) Are Optional But Valuable
Information at a Glance

Why HCLTech?

At HCLTech, you'll supercharge your potential. You'll find your career. And you'll find your spark. All at a place that knows that helping its customers stay on top starts by putting its people first.

HCLTech is a global technology company, home to more than 226,300 people across 60 countries, delivering industry-leading capabilities centered around digital, engineering, cloud and AI, powered by a broad portfolio of technology services and products. We work with clients across all major verticals, providing industry solutions for Financial Services, Manufacturing, Life Sciences and Healthcare, Technology and Services, Telecom and Media, Retail and CPG, and Public Services. Consolidated revenues as of 12 months ending December 2025 totaled $14.5 billion.

23 Benefits At HCLTech, we believe in empowering our employees with comprehensive benefits that support their professional growth and enhance their well-being. When you sign up for a career with us, you gain access to: https://rmkcdn.successfactors.com/147eb21f/a701dca9-f32d-4fc9-9447-6.svg Industry-benchmarked compensation https://rmkcdn.successfactors.com/147eb21f/b0c54381-ddcc-4a33-9b35-9.svg Best-in-class healthcare benefits https://rmkcdn.successfactors.com/147eb21f/b73027be-7aae-4d36-a090-4.svg Personal time off https://rmkcdn.successfactors.com/147eb21f/d5b4fdfd-2e99-4e26-9878-9.svg Maternity and paternity benefits https://rmkcdn.successfactors.com/147eb21f/3d42b0fc-4652-435a-9ece-c.svg Access to skills / higher education programs/resources https://rmkcdn.successfactors.com/147eb21f/aeddeaf2-9e25-4584-ad11-d.svg Discounts on products and services via Benefit Box https://rmkcdn.successfactors.com/147eb21f/a9609a3b-2700-4b3c-9d90-a.svg Participate in CSR programs and live life with a purpose https://rmkcdn.successfactors.com/147eb21f/c6e33851-710f-4634-bd69-f.svg Opportunities to grow and advance your career Note: The benefits listed above vary depending on the nature of your employment and the country where you work. Some benefits may be available in some countries but not in all.