Role:
Site Reliability Engineer
Location:
Hove, UK (Hybrid)
Contract:
12-Month Fixed Term Contract (
PAYE )
Salary:
£85,000 per annum
Start:
Immediate starters preferred
Overview
We are seeking an experienced
Site Reliability Engineer (SRE)
to drive modernisation of IT operations through observability, automation, and reliability engineering practices. This role will focus on improving system scalability, reducing operational toil, and embedding a culture of automation and continuous improvement.
Insurance or financial services background preferred.
AWS and Azure experience essential.
Key Responsibilities
* Implement SRE practices to improve reliability, scalability, and operational efficiency
* Design and deploy observability platforms for monitoring and performance insights
* Drive automation and toil reduction initiatives
* Define and manage SLOs, SLIs, and error budgets
* Lead incident management, root cause analysis, and continuous improvement
* Deliver AI-driven alerting and proactive anomaly detection strategies
* Collaborate with engineering and product teams to enable resilient platforms
* Promote automation, self-healing systems, and shift-left engineering practices
Key Skills & Experience
* Strong hands-on SRE experience in enterprise environments
* Observability tools:
Dynatrace and Datadog
* Automation & scripting:
Python and Ansible
* Cloud platforms:
AWS & Azure (mandatory)
* Containers & orchestration: Docker, Kubernetes
* CI/CD and cloud-native distributed systems experience
* Understanding of AIOps and automation-driven operations desirable
Preferred Background
* 12+ years in SRE, DevOps, or IT Operations roles
* Experience delivering observability and automation at scale
* Relevant cloud or SRE certifications advantageous