Job Summary
Role Overview We are looking for a Kubernetes Infrastructure Engineer responsible for designing, building, and managing the underlying infrastructure that supports Kubernetes clusters. This role focuses on cluster provisioning, networking, security, and automation across cloud environments. Shape Key Responsibilities Design and provision Kubernetes clusters (AKS/EKS/GKE) using Infrastructure as Code (Terraform/ARM/Bicep). Manage cluster lifecycle: creation, scaling, upgrades, and decommissioning. Configure and manage core infrastructure components: Node pools Virtual networks (VNet/VPC) Subnets, NSGs, route tables Implement and manage private clusters, private endpoints, and secure access patterns. Build and maintain cluster networking: Ingress controllers (AGIC/Nginx) Load balancers (L4/L7) DNS configuration and resolution Integrate Kubernetes with cloud-native services: Azure Container Registry (ACR) Key Vault / Secrets Manager Monitoring (Azure Monitor, Log Analytics) Implement identity and access management: RBAC Azure AD integration / Workload Identity / Managed Identity Ensure infrastructure security best practices: Network policies Pod security standards Image governance Automate infrastructure deployment and management via CI/CD pipelines. Troubleshoot infra-level issues: Node failures Networking/DNS issues Image pull failures (ACR access) Optimize cluster performance, cost, and resource utilization. Maintain high availability and disaster recovery strategies. Document infrastructure architecture and SOPs. Shape Required Skills & Qualifications Strong experience in Kubernetes infrastructure and cluster architecture. Hands-on experience with AKS (preferred) or EKS/GKE. Expertise in Infrastructure as Code: Terraform (mandatory/preferred) Strong knowledge of cloud networking: VNets, Subnets, Peering Private Endpoint, Private DNS Experience with container registries (ACR). Understanding of Kubernetes internals: Control plane vs worker nodes Scheduling and scaling Experience with CI/CD pipelines for infra provisioning. Strong troubleshooting skills in multi-layer environments (network + cluster). Shape Preferred Skills Experience with: Application Gateway Ingress Controller (AGIC) Azure Workload Identity / Federated Identity Knowledge of: Service Mesh (Istio) GitOps (ArgoCD, Flux) Experience managing private AKS clusters Exposure to security and compliance frameworks. Certifications: CKA / CKAD Azure Administrator / Azure DevOps Engineer Shape Day-to-Day Activities Provision and update AKS clusters using Terraform. Manage networking, DNS, and connectivity across environments. Troubleshoot infra-level cluster issues (node pool, VNet, ACR access). Support application teams with environment readiness. Monitor cluster performance and optimize resources. Implement security controls and access policies.
Key Responsibilities
Role Overview We are looking for a Kubernetes Infrastructure Engineer responsible for designing, building, and managing the underlying infrastructure that supports Kubernetes clusters. This role focuses on cluster provisioning, networking, security, and automation across cloud environments. Shape Key Responsibilities Design and provision Kubernetes clusters (AKS/EKS/GKE) using Infrastructure as Code (Terraform/ARM/Bicep). Manage cluster lifecycle: creation, scaling, upgrades, and decommissioning. Configure and manage core infrastructure components: Node pools Virtual networks (VNet/VPC) Subnets, NSGs, route tables Implement and manage private clusters, private endpoints, and secure access patterns. Build and maintain cluster networking: Ingress controllers (AGIC/Nginx) Load balancers (L4/L7) DNS configuration and resolution Integrate Kubernetes with cloud-native services: Azure Container Registry (ACR) Key Vault / Secrets Manager Monitoring (Azure Monitor, Log Analytics) Implement identity and access management: RBAC Azure AD integration / Workload Identity / Managed Identity Ensure infrastructure security best practices: Network policies Pod security standards Image governance Automate infrastructure deployment and management via CI/CD pipelines. Troubleshoot infra-level issues: Node failures Networking/DNS issues Image pull failures (ACR access) Optimize cluster performance, cost, and resource utilization. Maintain high availability and disaster recovery strategies. Document infrastructure architecture and SOPs. Shape Required Skills & Qualifications Strong experience in Kubernetes infrastructure and cluster architecture. Hands-on experience with AKS (preferred) or EKS/GKE. Expertise in Infrastructure as Code: Terraform (mandatory/preferred) Strong knowledge of cloud networking: VNets, Subnets, Peering Private Endpoint, Private DNS Experience with container registries (ACR). Understanding of Kubernetes internals: Control plane vs worker nodes Scheduling and scaling Experience with CI/CD pipelines for infra provisioning. Strong troubleshooting skills in multi-layer environments (network + cluster). Shape Preferred Skills Experience with: Application Gateway Ingress Controller (AGIC) Azure Workload Identity / Federated Identity Knowledge of: Service Mesh (Istio) GitOps (ArgoCD, Flux) Experience managing private AKS clusters Exposure to security and compliance frameworks. Certifications: CKA / CKAD Azure Administrator / Azure DevOps Engineer Shape Day-to-Day Activities Provision and update AKS clusters using Terraform. Manage networking, DNS, and connectivity across environments. Troubleshoot infra-level cluster issues (node pool, VNet, ACR access). Support application teams with environment readiness. Monitor cluster performance and optimize resources. Implement security controls and access policies.
Skill Requirements
Role Overview We are looking for a Kubernetes Infrastructure Engineer responsible for designing, building, and managing the underlying infrastructure that supports Kubernetes clusters. This role focuses on cluster provisioning, networking, security, and automation across cloud environments. Shape Key Responsibilities Design and provision Kubernetes clusters (AKS/EKS/GKE) using Infrastructure as Code (Terraform/ARM/Bicep). Manage cluster lifecycle: creation, scaling, upgrades, and decommissioning. Configure and manage core infrastructure components: Node pools Virtual networks (VNet/VPC) Subnets, NSGs, route tables Implement and manage private clusters, private endpoints, and secure access patterns. Build and maintain cluster networking: Ingress controllers (AGIC/Nginx) Load balancers (L4/L7) DNS configuration and resolution Integrate Kubernetes with cloud-native services: Azure Container Registry (ACR) Key Vault / Secrets Manager Monitoring (Azure Monitor, Log Analytics) Implement identity and access management: RBAC Azure AD integration / Workload Identity / Managed Identity Ensure infrastructure security best practices: Network policies Pod security standards Image governance Automate infrastructure deployment and management via CI/CD pipelines. Troubleshoot infra-level issues: Node failures Networking/DNS issues Image pull failures (ACR access) Optimize cluster performance, cost, and resource utilization. Maintain high availability and disaster recovery strategies. Document infrastructure architecture and SOPs. Shape Required Skills & Qualifications Strong experience in Kubernetes infrastructure and cluster architecture. Hands-on experience with AKS (preferred) or EKS/GKE. Expertise in Infrastructure as Code: Terraform (mandatory/preferred) Strong knowledge of cloud networking: VNets, Subnets, Peering Private Endpoint, Private DNS Experience with container registries (ACR). Understanding of Kubernetes internals: Control plane vs worker nodes Scheduling and scaling Experience with CI/CD pipelines for infra provisioning. Strong troubleshooting skills in multi-layer environments (network + cluster). Shape Preferred Skills Experience with: Application Gateway Ingress Controller (AGIC) Azure Workload Identity / Federated Identity Knowledge of: Service Mesh (Istio) GitOps (ArgoCD, Flux) Experience managing private AKS clusters Exposure to security and compliance frameworks. Certifications: CKA / CKAD Azure Administrator / Azure DevOps Engineer Shape Day-to-Day Activities Provision and update AKS clusters using Terraform. Manage networking, DNS, and connectivity across environments. Troubleshoot infra-level cluster issues (node pool, VNet, ACR access). Support application teams with environment readiness. Monitor cluster performance and optimize resources. Implement security controls and access policies.
Other Requirements
NA