 
        
        Senior Site Reliability Engineer - DBT - G7
Location: Belfast, Birmingham, Cardiff, Darlington, Edinburgh, London, Salford
This role is part of the Department for Business and Trade (DBT) Digital, Data and Technology (DDaT) directorate.
About the Role
We are building a cutting‑edge developer platform in AWS to support DBT services.
As a Senior Site Reliability Engineer you will provide the tools and practices that enable development teams to deliver reliable, performant services.
Job Description
 * Develop and maintain observability tooling to enhance system monitoring and incident response.
 * Streamline deployment processes to reduce downtime and speed up feature delivery.
 * Partner with product teams to design and enforce service‑level indicators, objectives, and error budgets.
 * Participate in on‑call rota and mentor junior colleagues across DDaT.
Technology Stack
 * Amazon Web Services (AWS) – CodeBuild, CodePipeline, Copilot, CloudFormation
 * Azure
 * Terraform / Pulumi
 * Docker, ECS, ECR, OpenSearch
 * Python / Django
 * PostgreSQL (Amazon RDS)
 * Sentry, Redis / Elasticache
Essential Qualifications
 * Cloud experience with AWS, Azure or Google Cloud.
 * Experience building code‑defined, reliable infrastructure (Terraform, CloudFormation, Pulumi).
 * Fluent in at least one programming language; clean, effective code.
 * Design, analyse and troubleshoot distributed systems.
 * Knowledge of Linux/Unix fundamentals and TCP/IP networking.
 * Strong communication skills for technical and non‑technical stakeholders.
Desirable Qualifications
 * Defining and measuring Service Level Objectives through observability.
 * Prototyping using existing open‑source components.
#J-18808-Ljbffr