Job Summary
Key Responsibilities
- Drive enterprise SRE strategy, standards, and governance
- Establish SLOs, SLIs, Error Budgets, and reliability KPIs
- Deliver reliability outcomes working collaboratively with enterprise teams
- Coach engineering and operations teams on SRE best practices
- Lead observability, resilience, performance engineering, and chaos engineering initiatives
- Promote Infrastructure as Code, CI/CD, and automation-first operations
- Partner with cloud, platform, architecture, and security teams
Requirements
- Expertise in SRE, Production Systems Engineering, Platform Engineering
- Strong expertise in cloud, infrastructure, applications, and observability tools (Dynatrace, Splunk, etc.)
- Experience in incident management, RCA, self-healing systems, and automation
- Strong communication, mentoring, and stakeholder management skills
- Experience across enterprise platforms such as Azure, Java/.NET, SAP, Salesforce, SaaS/COTS, and legacy systems is a plus.
- Good to have experience in Industrial platforms such as Siemens, ABB, AutoCAD
Key Responsibilities
2. To provide solution in business transformation by participating in RFP/RFI; Due Diligence; Pre Sales support and Oversee POC
3. To perform engagement level Delivery governance
4. To mentor technical and solution architects along with self knowledge up-gradation and learning