Job Summary
To play a pivotal role in designing and implementing complex technical solutions, ensuring they align with business objectives and industry best practices.
Role Summary
Ensures reliable, scalable, and cost-efficient deployment and operations of GenAI applications.
Key Responsibilities
- Manage CI/CD pipelines and automated deployments
- Containerization (Docker) and orchestration (Kubernetes)
- Implement AI observability (latency, tokens, errors)
- Optimize infrastructure and LLM cost
- Ensure high availability and incident management
- Manage security (secrets, access controls)
- Continuous monitoring and improvement
Skills
- DevOps / MLOps expertise
- Azure cloud (preferred)
- CI/CD tools (Azure DevOps, GitHub Actions)
- Monitoring and observability tools
Success Metrics
- System uptime
- Cost efficiency
- Deployment frequency
- Incident resolution (MTTR)
Key Responsibilities
2. To spearhead the architecture, design, and development (through a high-performing team) of innovative solutions for product/project & sustenance delivery, ensuring alignment with strategic objectives.
3. To ensure knowledge up-gradation and work with new technologies so that the solution is current and meets quality standards and the client requirements
4. To review architecture and design deliverables and ensure solutions adhere to industry best practices ,architectural standards simultaneously establish and enforce governance /compliance measures.
5. To train and develop team so as to ensure that there is an adequate supply of trained manpower in the said technology and delivery risks are mitigated