Our client is one of the UK's leading ecommerce brands and, with flexible hybrid working in London, are looking for an SRE Engineering Manager.
Their Platform and Reliability teams are responsible for all services underpinning their websites and you'll drive operational excellence, observability and reliability at scale, and own the incident management processes and tools.
This position combines leadership, full stack reliability engineering and service management.
You'll need strong technical experience in reliability engineering, monitoring, alerting and observability combined with strong customer empathy and communication skills.
Requirements
* Proven experience in site reliability engineering management, observability, monitoring etc
* Good understanding of reliability in distributed software microservices and cloud-based environments.
* Experience implementing and running modern SRE tooling
* Experience improving operational processes and developing documented procedures
* Leadership, team management, collaboration and communication skills
#J-18808-Ljbffr