About the Role
We are looking for a highly experienced Senior DevOps Engineer with strong depth in Azure, Kubernetes (AKS), and Infrastructure-as-Code to become the senior technical lead within our platform team. You will bring the hands-on expertise needed to evolve our cloud-native infrastructure, CI/CD pipelines, and observability stack.
This role suits a self-starter with strong communication skills and the ability to mentor others, set direction, and introduce best practices.
Key Responsibilities
* Lead and own the design, operation, and improvement of our AKS platform.
* Develop and maintain Terraform/Terragrunt IaC modules and structure.
* Standardise and improve GitOps workflows using ArgoCD.
* Build and optimise CI/CD pipelines using Azure DevOps or Jenkins.
* Enhance and maintain observability using Prometheus, Grafana, Loki, Tempo, and OpenTelemetry.
* Use Ansible to automate configuration management and multi-server upgrades, on Windows and Linux.
* Mentor and support existing DevOps engineers, developing depth in our DevOps practices.
* Collaborate with the wider infrastructure team (cloud, networking, Windows) to improve overall capability.
* Work closely with the product & innovation teams to optimise incremental delivery of software in test and production environments.
* Promote automation, documentation, and secure-by-default practices.
* Support escalations on a best-effort basis during critical incidents.
Key Requirements
* 5+ years in DevOps/SRE/Platform Engineering roles with deep Azure experience.
* Strong Kubernetes (AKS) operational expertise (networking, scaling, RBAC, Helm workload security, troubleshooting).
* Advanced knowledge of Terraform/Terragrunt and IaC module design.
* Solid grasp of CI/CD practices and experience with GitOps (ArgoCD) in production.
* Hands-on experience with observability tooling: Prometheus, Grafana, Loki, Tempo, OpenTelemetry.
* Practical experience with Ansible for configuration management and automation (Windows and/or Linux).
* Good Linux knowledge and Bash scripting skills.
Nice to Have
* Experience with Windows Server automation via Ansible/WinRM or DSC.
* Familiarity with container registries, image pipelines, and vulnerability scanning.
* Comfortable writing automation in Go.
What You Can Expect
* A technically challenging role with a genuine opportunity to influence on platform direction.
* A collaborative, geographically dispersed environment with a mixture of networking, Windows and cloud architecture backgrounds.
* Remote working.
How to Apply
If this sounds like an interesting opportunity and you’d like to find out more about the role, please apply by providing a cover letter along with a recent copy of your CV.
We’re currently focusing on direct applications, so we kindly ask agencies not to get in touch.