MongoDB Site Reliability Engineer £50,000 to 64,000 GBP bonus Hybrid WORKING Location: Manchester, North West - United Kingdom Type: Permanent MongoDB Site Reliability Engineer Location: Greater Manchester Contract: Permanent Overview An opportunity has arisen for a MongoDB Site Reliability Engineer to join a high-performing infrastructure and engineering function. You will be responsible for ensuring the stability, performance, and resilience of large-scale, business-critical systems. This role is ideal for an engineer who thrives in complex environments, enjoys solving multifaceted infrastructure and reliability challenges, and is passionate about automation, observability, and continuous improvement. Key Responsibilities Service Reliability & Support Provide advanced technical support to maintain and enhance the stability of critical systems. Execute preventative maintenance activities and leverage monitoring tools to detect, address, and prevent incidents. Analyse logs, metrics, and user reports to identify root causes and deliver timely resolutions. Automation & Efficiency Develop automation solutions to reduce manual effort and optimise system operations. Enhance monitoring, alerting thresholds, and observability to ensure issues are detected and acted on quickly. Operational Excellence Maintain detailed documentation and contribute to knowledge repositories for future reference and improved self-service. Support capacity management, business continuity, and performance tuning across MongoDB environments. Identify risks and recommend improvements to strengthen reliability and operational controls. Stakeholder Collaboration Work with cross-functional teams to deliver high-quality support and improvements. Provide input into policy development, process enhancements, and technical direction. Communicate technical issues and solutions clearly to varied audiences, including senior stakeholders. Required Skills & Experience Essential Experience in Site Reliability Engineering, DevOps, or MongoDB administration within complex environments. Strong MongoDB knowledge, including replica sets, sharding, backups, shell usage, and performance tuning. Practical experience writing automation scripts in Python or Bash. Highly Valued Experience with Percona, ClusterControl, CI/CD pipelines, and configuration/automation tools (Ansible, Chef). Monitoring experience using Prometheus, Grafana, or ELK stack. Exposure to Kubernetes or containerised environments. Understanding of API development (FastAPI) and scalable, high-performance system design. Role Purpose To ensure the effective monitoring, maintenance, and reliability of core technology platforms. The role focuses on reducing operational risk, improving system resilience, and delivering high-quality support for critical services. Leadership & Contribution Expectations Depending on level, responsibilities may include: Advising leadership on risk, controls, and operational improvement. Leading or guiding teams through complex technical tasks and collaborative assignments. Developing documentation, policies, and best practices to enhance governance and operational effectiveness. Supporting strategic initiatives through detailed analysis and cross-functional coordination. Reference: AMC-AQU-MDBB Postcode: Manchester (M1) adqu