Requirements
* Experience building and shipping production applications
* Excellent communication and collaboration skills with a curiosity to solve problems
* Experience using cloud platforms such as AWS (preferred), Google Cloud, or Azure
* Strong desire or experience in creating and maintaining Kubernetes clusters in production
* Solid foundational understanding of distributed systems
* Experience with CI/CD, GitOps, and Infrastructure as Code technologies
* We understand some people may not apply for jobs unless they tick every box. But if you're excited about joining us and think you have some of what we're looking for, even if you're not 100% sure, we'd still love to hear from you
What the job involves
* Reporting To: Engineering Manager
* This role is based in the UK and requires an existing right to work in the UK.
* At this time, we are not able to offer visa sponsorship for this role.
* We are committed to building a diverse, global team and our sponsorship policy is evaluated on a role‑by‑role basis.
* You’ll play a pivotal role in building and evolving our internal developer platform to empower our software teams.
* Your work will impact how our developers use cloud infrastructure, help teams eliminate infrastructure toil, improve engineering productivity, reinforce security safeguards, and constantly strive for greater cost efficiencies in operations.
* Engineering Foundations: Design and maintain a scalable, reliable, and developer‑friendly platform to promote seamless software development, deployment, and operations.
* Automation & Tooling: Develop tools to abstract complex platform aspects from developers, improving developer experience and operational efficiency.
* Infrastructure as Code: Use tools such as Terraform, Helm, and Crossplane to manage and automate provisioning of cloud infrastructure, championing GitOps best practices.
* Monitoring & Observability: Establish robust monitoring and logging foundations in collaboration with the Site Reliability Engineering Function.
* Security & Reliability: Incorporate security best practices, ensure compliance, and participate in the out‑of‑hours on‑call rota to troubleshoot and prevent recurrent issues.
* Stakeholder Coordination: Foster strong relationships with internal and external stakeholders to coordinate upgrades and hold service providers accountable.
#J-18808-Ljbffr