Location: Glasgow (Hybrid – 2 days per week onsite)
We are looking for a passionate and experienced Site Reliability Engineer (SRE) to help drive reliability, scalability, and operational excellence across our platforms. This role is ideal for someone who enjoys working with complex systems, applying engineering rigour to operations, and influencing reliability practices across teams.
What you’ll be doing
* Champion and embed SRE best practices, helping to mature reliability capabilities across multiple stakeholder groups.
* Perform advanced fault analysis and troubleshooting using the scientific method—forming hypotheses, gathering evidence, validating assumptions, and drawing clear conclusions.
* Use data and insights to drive continuous improvement, with a strong focus on measurement, experimentation, and actionable outcomes.
* Facilitate technical discussions and lead blameless post‑incident reviews, ensuring learnings are shared and improvements are implemented.
* Clearly explain complex systems and behaviours to both technical and non‑technical audiences through documentation, diagrams, and narrative.
* Research, evaluate, and actively experiment with new tools and technologies to improve predictability, observability, and performance.
* Share knowledge, mentor others, and actively contribute to a culture of learning and improvement.
What we’re looking for
To be successful in this role, you should have:
* Proven hands‑on experience with Site Reliability Engineering practices, including strong programming skills and the ability to influence SRE maturity across teams.
* Excellent problem‑solving skills, with a structured and analytical approach to fault diagnosis and incident resolution.
* A passion for continuous improvement, driven by data, metrics, and a belief that measurement is essential to progress.
* Strong communication skills, with the ability to lead technical conversations, run retrospectives, and influence organisational direction.
* Curiosity and enthusiasm for understanding complex, socio‑technical systems and how people and technology interact.
* A genuine interest in lifelong learning and teaching others.
Highly valued (but not essential)
* Deep knowledge of systems engineering, including operating systems, networking, cloud platforms, automation tools, and Infrastructure as Code.
* Experience or strong interest in using AI to solve technology problems more efficiently and at scale.
* Hands‑on experience with observability tools and techniques, including instrumentation, metrics, logging, and tracing.
* Competitive salary of up to £80,000
* Hybrid working model – 2 days per week onsite in Manchester
* Work on meaningful, complex systems where reliability truly matters
* Influence engineering culture and operational practices across teams
* Continuous learning, experimentation, and professional growth encouraged
#J-18808-Ljbffr