Role: Tech Lead, Site Reliability Engineer
Location: London (Hybrid – 1 day per week in office)
We are working with a mission-led technology organisation that continues to invest heavily in its cloud platform and infrastructure capabilities. As part of this growth, they are looking for a Tech Lead SRE to help shape the platform's reliability, scalability, and operational maturity.
This is a technical leadership role rather than a people management role. You will serve as the senior technical voice within the SRE function, guiding engineering standards and reliability practices across the platform while remaining hands‑on.
You will work closely with platform engineers, cloud engineers and product teams to ensure services are reliable, scalable and observable as the platform continues to grow.
Key Responsibilities
* Acting as the technical lead within the SRE function
* Improving platform reliability, monitoring and observability
* Working closely with engineering teams to design resilient systems
* Driving automation, CI/CD and infrastructure improvements
* Supporting incident response and root cause analysis
* Helping define SRE best practices and reliability standards
Tech Environment
* Cloud platforms (AWS, GCP or Azure)
* Kubernetes and containerised workloads
* Terraform or Infrastructure as Code
* Prometheus, Grafana or modern observability tooling
* CI/CD pipelines and automation
* Python, Go or similar scripting languages
About You
* Strong background in SRE, DevOps or Cloud Infrastructure
* Experience running production systems at scale
* Strong understanding of monitoring, reliability and automation
* Comfortable acting as a technical leader within engineering teams
#J-18808-Ljbffr