Senior Site Reliability Engineer (UK or Germany) – Fully Remote – £120k + Benefits
We are hiring experienced Senior Site Reliability Engineers to join a global engineering team supporting a high‑availability, Java‑based platform used by customers worldwide.
This is a permanent, fully remote role open to candidates based in the UK or Germany, offering a competitive package of ~£120k + benefits.
If you are a true SRE (not DevOps-focused) who cares deeply about reliability, stability, incident response, and performance at scale, we want to speak with you.
What You’ll Do
Ensure high availability, scalability, reliability, and security across production environments
Lead live incident response, drive root‑cause analysis, and deliver lasting solutions
Build and maintain SLIs, SLOs, and SLAs
Support a core Java product: patching, SDKs, configuration (YAML), and uptime work
Drive automation using Python, Linux tooling, and IaC
Work closely with security, compliance, and multiple engineering teams
Participate in a 24/7 on‑call rotation (1 week every 4–5 weeks)
Tech Stack & Skills
AWS: EC2, EKS, Load Balancers, VPC — with hands‑on production experience
Linux: Deep troubleshooting & sysadmin fundamentals
Python: Scripting for automation
SRE mindset: Incident management, observability, reliability engineering principle
We’re Looking For
Senior‑level SREs with proven experience running large‑scale, mission‑critical systems
Engineers who love digging into incidents, solving problems properly, and improving systems over time
Professionals who thrive in autonomous, globally distributed teams.