Job Summary
Manage AWS PaaS and Kubernetes environments (EKS/ROSA/OpenShift) - Ensure platform availability, performance, and reliability - Perform incident management, troubleshooting, and RCA for critical issues - Implement Infrastructure as Code using Terraform and AWS CloudFormation - Automate provisioning and operational tasks - Manage monitoring and observability using Elastic Stack and Grafana - Define and manage SLOs, SLIs, and SLAs - Perform capacity planning and performance optimization - Support change, problem, and release management processes - Collaborate with cloud, network, security, and application teams
Key Responsibilities
Responsible for ensuring reliability, scalability, and performance of AWS PaaS and Kubernetes-based platforms, along with automation and monitoring aligned to managed services model.
Skill Requirements
Technical Skills: - Kubernetes (EKS/ROSA/OpenShift) - AWS Cloud (EC2, VPC, Load Balancer, Storage) - Terraform - AWS CloudFormation - Elastic Stack (ELK), Grafana - Linux administration - Incident and problem management Preferred Skills: - Ansible automation - ServiceNow ITSM - CI/CD pipelines
Other Requirements
AWS Certified Solutions Architect / DevOps - Certified Kubernetes Administrator (CKA) - Terraform Associate