Job Summary
Client JD
Exciting opportunity for SRE DevOps Engineer to join our Client Servicing and Engagement platform. As SRE DevOps Engineer you’ll be responsible for ensuring our products run reliably, are scalable, and perform optimally in production environments.
You'll supervise and manage these aspects to ensure our products meet the expected Service Level Objectives (SLOs) while also being accountable for one or more areas of the cloud infrastructure resources alongside supervising the work of the SREs in that area. You’ll focus on observability of your technical areas and prioritising the operational service improvements to best improve the SLOs.
What you’ll be doing:
· Maintain, support & improve our cloud infrastructure in a specific technical area
· Investigate, fix and remove service issues with an engineering mindset
· Identify ways in which to improve observability continuously
· Identify toil relentlessly and design automated solutions to remove it
· Prioritise operational service improvements to meet or increase SLO
· Lead incident post-mortems
· working with PO to balance run, change and improve stories in each sprint based on metrics e.g. outage rate increasing
· Support building a strong team by mentoring early career engineers to advance their technical skills, and by undertaking technical interviews to enable us to hire new engineers
Why join us?
We’re transforming at pace. Investing billions in our people, data and tech to change the way we meet the needs of our 28 million customers. We’re growing, and we’d love you to be part of the journey.
What we’re looking for:
We’re looking for an individual with 5+ years’ experience across a broad skillset, capable of applying technical leadership across:
· Kubernetes (AKS/GKE Vital Requirement to understand) and Service Mesh (Istio is currently used)
· CI/CD Automation for Build & Release (Important to have experience in 1 or more tooling solutions)
· Automated Unit/Integration/Load/Performance Testing
· Observability, Logging, Monitoring & Alerting
· Experience programming in at least two (but not all!) of the following languages: Java, Python, Go, C++, JavaScript, TypeScript, PowerShell or Bash/Shell
· Azure technologies such as App Gateway, API Manager, AKS, Cosmos DB, Azure SQL, Azure Firewall
· Proficient in monitoring tools e.g. Azure Monitor/Log Analytics/Dynatrace, Security Cloud & API, Encryption & Certificates.
· Production Kubernetes experience and debugging all services that run within the K8s ecosystem, including Istio service mesh with Envoy.
· Helmfile & Helm Charts writing comprehensive helm charts from scratch.
· SRE mentality (SLI, SLO & SLA) using Observability, Logging, Monitoring & Alerting ( Dynatrace).
· Cloud Infrastructure Provisioning IaC (Infrastructure as service) maintaining and support.
· Cluster Management.
· Harness Tool Experience in Templates creation.
· Dynatrace APM Tool Experience.
· Azure DevOps
· Toil Reduction and Automation.
· FinOps Experience on Cloud Platform.
· Knowledge of Azure and GCP Cloud Platform Operations.
Key Responsibilities
2. Collaborate with cross functional teams to ensure seamless integration of devops practices throughout the software development lifecycle.
3. Develop and maintain ci/cd pipelines to automate build, test, and deployment processes.
4. Monitor and optimize system performance, reliability, and security in a cloud environment.
5. Troubleshoot technical issues related to devops tools and provide timely resolution.
6. Stay updated on industry trends and best practices in devops, kubernetes, and azure technologies.
7. Train and mentor junior team members on devops principles and best practices.
Skill Requirements
2. Proficiency in using kubernetes for container orchestration.
3. Handson experience with azure devops for continuous integration and continuous deployment.
4. Strong understanding of cloud computing concepts and technologies.
5. Familiarity with scripting languages like python, shell, or powershell.
6. Excellent problem-solving and analytical skills.
7. Effective communication and collaboration abilities.
8. Ability to work in a fast paced and dynamic environment.
Other Requirements
Exciting opportunity for SRE DevOps Engineer to join our Client Servicing and Engagement platform. As SRE DevOps Engineer you’ll be responsible for ensuring our products run reliably, are scalable, and perform optimally in production environments.
You'll supervise and manage these aspects to ensure our products meet the expected Service Level Objectives (SLOs) while also being accountable for one or more areas of the cloud infrastructure resources alongside supervising the work of the SREs in that area. You’ll focus on observability of your technical areas and prioritising the operational service improvements to best improve the SLOs.
What you’ll be doing:
· Maintain, support & improve our cloud infrastructure in a specific technical area
· Investigate, fix and remove service issues with an engineering mindset
· Identify ways in which to improve observability continuously
· Identify toil relentlessly and design automated solutions to remove it
· Prioritise operational service improvements to meet or increase SLO
· Lead incident post-mortems
· working with PO to balance run, change and improve stories in each sprint based on metrics e.g. outage rate increasing
· Support building a strong team by mentoring early career engineers to advance their technical skills, and by undertaking technical interviews to enable us to hire new engineers
Why join us?
We’re transforming at pace. Investing billions in our people, data and tech to change the way we meet the needs of our 28 million customers. We’re growing, and we’d love you to be part of the journey.
What we’re looking for:
We’re looking for an individual with 5+ years’ experience across a broad skillset, capable of applying technical leadership across:
· Kubernetes (AKS/GKE Vital Requirement to understand) and Service Mesh (Istio is currently used)
· CI/CD Automation for Build & Release (Important to have experience in 1 or more tooling solutions)
· Automated Unit/Integration/Load/Performance Testing
· Observability, Logging, Monitoring & Alerting
· Experience programming in at least two (but not all!) of the following languages: Java, Python, Go, C++, JavaScript, TypeScript, PowerShell or Bash/Shell
· Azure technologies such as App Gateway, API Manager, AKS, Cosmos DB, Azure SQL, Azure Firewall
· Proficient in monitoring tools e.g. Azure Monitor/Log Analytics/Dynatrace, Security Cloud & API, Encryption & Certificates.
· Production Kubernetes experience and debugging all services that run within the K8s ecosystem, including Istio service mesh with Envoy.
· Helmfile & Helm Charts writing comprehensive helm charts from scratch.
· SRE mentality (SLI, SLO & SLA) using Observability, Logging, Monitoring & Alerting ( Dynatrace).
· Cloud Infrastructure Provisioning IaC (Infrastructure as service) maintaining and support.
· Cluster Management.
· Harness Tool Experience in Templates creation.
· Dynatrace APM Tool Experience.
· Azure DevOps
· Toil Reduction and Automation.
· FinOps Experience on Cloud Platform.
· Knowledge of Azure and GCP Cloud Platform Operations.