 
        
        Join to apply for the Senior Site Reliability Engineer role at Venquis
Location: Remote / Hybrid – UK
Sector: E-Commerce & Retail Platforms
A global retail brand is scaling its e-commerce and digital customer platforms, handling millions of daily transactions and peak seasonal traffic. To support this growth, they are hiring a Site Reliability Engineer with deep expertise in observability, cloud scalability, and performance tuning.
What you’ll do:
 * Build and maintain highly scalable cloud infrastructure for large-scale e-commerce platforms.
 * Develop monitoring and observability frameworks to ensure fast response to performance bottlenecks.
 * Optimise CDN, caching, and APIs for high-traffic shopping events (e.g., Black Friday).
 * Drive automation and CI/CD pipelines to accelerate feature delivery without compromising stability.
 * Partner with software engineering teams to ensure always-on shopping experiences.
What we’re looking for:
 * Proven track record in high-scale distributed systems (retail, e-commerce, digital platforms).
 * Expertise in observability stacks (Grafana, Prometheus, Datadog, NewRelic, Elastic).
 * Strong cloud skills (AWS/GCP/Azure) including Kubernetes and serverless.
 * Solid coding skills for automation (Python, Go, JavaScript, Bash).
 * Experience optimising performance in high-traffic digital platforms.
This is your chance to build reliability at retail scale, where seconds of downtime mean millions in lost revenue.
Highly competitive salary plus bonus % paid yearly.
Venquis is acting as an Employment Agency in relation to this vacancy.
Seniority level
Mid-Senior level
Employment type
Full-time
Job function
Engineering and Information Technology
#J-18808-Ljbffr