Administrator - Azure DevOps, Terraform
Canada
Job Description
Administrator - Azure DevOps, Terraform
Mississauga, Ontario

Job Summary

SRE/DevOps Engineer responsible for ensuring system reliability, scalability, and performance by combining software engineering with operations, automation, and continuous delivery practices. Key Responsibilities Design and manage highly available, scalable, and reliable systems Implement and maintain CI/CD pipelines for faster and stable releases Monitor system health using observability tools (metrics, logs, traces) Define and manage SLIs, SLOs, and SLAs Automate infrastructure using Infrastructure as Code (IaC) Perform incident management, root cause analysis (RCA), and problem resolution Optimize system performance, cost, and capacity planning Ensure system security, compliance, and resilience Collaborate with development teams to improve system design and reliability Drive automation to reduce manual intervention and improve efficiency Required Skills Strong experience with DevOps tools (Azure DevOps, Jenkins, GitLab CI/CD) Expertise in cloud platforms (Azure/AWS/GCP) Knowledge of containerization (Docker) and orchestration (Kubernetes) Experience in monitoring & logging tools (Prometheus, Grafana, ELK, Azure Monitor, Splunk) Proficiency in scripting (Python, Bash, PowerShell) Hands-on with Terraform, Ansible, or ARM templates Understanding of networking, OS (Linux), and distributed systems Experience in incident response and production support

Key Responsibilities

SRE/DevOps Engineer responsible for ensuring system reliability, scalability, and performance by combining software engineering with operations, automation, and continuous delivery practices. Key Responsibilities Design and manage highly available, scalable, and reliable systems Implement and maintain CI/CD pipelines for faster and stable releases Monitor system health using observability tools (metrics, logs, traces) Define and manage SLIs, SLOs, and SLAs Automate infrastructure using Infrastructure as Code (IaC) Perform incident management, root cause analysis (RCA), and problem resolution Optimize system performance, cost, and capacity planning Ensure system security, compliance, and resilience Collaborate with development teams to improve system design and reliability Drive automation to reduce manual intervention and improve efficiency Required Skills Strong experience with DevOps tools (Azure DevOps, Jenkins, GitLab CI/CD) Expertise in cloud platforms (Azure/AWS/GCP) Knowledge of containerization (Docker) and orchestration (Kubernetes) Experience in monitoring & logging tools (Prometheus, Grafana, ELK, Azure Monitor, Splunk) Proficiency in scripting (Python, Bash, PowerShell) Hands-on with Terraform, Ansible, or ARM templates Understanding of networking, OS (Linux), and distributed systems Experience in incident response and production support

Skill Requirements

SRE/DevOps Engineer responsible for ensuring system reliability, scalability, and performance by combining software engineering with operations, automation, and continuous delivery practices. Key Responsibilities Design and manage highly available, scalable, and reliable systems Implement and maintain CI/CD pipelines for faster and stable releases Monitor system health using observability tools (metrics, logs, traces) Define and manage SLIs, SLOs, and SLAs Automate infrastructure using Infrastructure as Code (IaC) Perform incident management, root cause analysis (RCA), and problem resolution Optimize system performance, cost, and capacity planning Ensure system security, compliance, and resilience Collaborate with development teams to improve system design and reliability Drive automation to reduce manual intervention and improve efficiency Required Skills Strong experience with DevOps tools (Azure DevOps, Jenkins, GitLab CI/CD) Expertise in cloud platforms (Azure/AWS/GCP) Knowledge of containerization (Docker) and orchestration (Kubernetes) Experience in monitoring & logging tools (Prometheus, Grafana, ELK, Azure Monitor, Splunk) Proficiency in scripting (Python, Bash, PowerShell) Hands-on with Terraform, Ansible, or ARM templates Understanding of networking, OS (Linux), and distributed systems Experience in incident response and production support

Other Requirements

SRE/DevOps Engineer responsible for ensuring system reliability, scalability, and performance by combining software engineering with operations, automation, and continuous delivery practices. Key Responsibilities Design and manage highly available, scalable, and reliable systems Implement and maintain CI/CD pipelines for faster and stable releases Monitor system health using observability tools (metrics, logs, traces) Define and manage SLIs, SLOs, and SLAs Automate infrastructure using Infrastructure as Code (IaC) Perform incident management, root cause analysis (RCA), and problem resolution Optimize system performance, cost, and capacity planning Ensure system security, compliance, and resilience Collaborate with development teams to improve system design and reliability Drive automation to reduce manual intervention and improve efficiency Required Skills Strong experience with DevOps tools (Azure DevOps, Jenkins, GitLab CI/CD) Expertise in cloud platforms (Azure/AWS/GCP) Knowledge of containerization (Docker) and orchestration (Kubernetes) Experience in monitoring & logging tools (Prometheus, Grafana, ELK, Azure Monitor, Splunk) Proficiency in scripting (Python, Bash, PowerShell) Hands-on with Terraform, Ansible, or ARM templates Understanding of networking, OS (Linux), and distributed systems Experience in incident response and production support

Information at a Glance

Why HCLTech?

At HCLTech, you'll supercharge your potential. You'll find your career. And you'll find your spark. All at a place that knows that helping its customers stay on top starts by putting its people first.

HCLTech is a global technology company, home to more than 226,300 people across 60 countries, delivering industry-leading capabilities centered around digital, engineering, cloud and AI, powered by a broad portfolio of technology services and products. We work with clients across all major verticals, providing industry solutions for Financial Services, Manufacturing, Life Sciences and Healthcare, Technology and Services, Telecom and Media, Retail and CPG, and Public Services. Consolidated revenues as of 12 months ending December 2025 totaled $14.5 billion.

23 Benefits At HCLTech, we believe in empowering our employees with comprehensive benefits that support their professional growth and enhance their well-being. When you sign up for a career with us, you gain access to: https://rmkcdn.successfactors.com/147eb21f/a701dca9-f32d-4fc9-9447-6.svg Industry-benchmarked compensation https://rmkcdn.successfactors.com/147eb21f/b0c54381-ddcc-4a33-9b35-9.svg Best-in-class healthcare benefits https://rmkcdn.successfactors.com/147eb21f/b73027be-7aae-4d36-a090-4.svg Personal time off https://rmkcdn.successfactors.com/147eb21f/d5b4fdfd-2e99-4e26-9878-9.svg Maternity and paternity benefits https://rmkcdn.successfactors.com/147eb21f/3d42b0fc-4652-435a-9ece-c.svg Access to skills / higher education programs/resources https://rmkcdn.successfactors.com/147eb21f/aeddeaf2-9e25-4584-ad11-d.svg Discounts on products and services via Benefit Box https://rmkcdn.successfactors.com/147eb21f/a9609a3b-2700-4b3c-9d90-a.svg Participate in CSR programs and live life with a purpose https://rmkcdn.successfactors.com/147eb21f/c6e33851-710f-4634-bd69-f.svg Opportunities to grow and advance your career Note: The benefits listed above vary depending on the nature of your employment and the country where you work. Some benefits may be available in some countries but not in all.