DevOps / Site Reliability Engineer
Location: Remote (United Kingdom)
Contract: Inside IR35
About the Role
We’re looking for a hands-on DevOps Engineer or Site Reliability Engineer with a solid foundation in cloud infrastructure and software development, and a strong emphasis on TypeScript, GitHub, and Terraform. You’ll be joining a growing Cloud Engineering team dedicated to building scalable internal platforms and tools that empower DevOps teams across the organization.
You’ll be instrumental in designing and operating high-availability, multi-cloud systems, collaborating across security, strategy, and architecture teams to deliver robust infrastructure and automation.
Key Responsibilities
* Design, build, and manage internal cloud platforms (AWS and Azure) used by engineering teams
* Develop and maintain Infrastructure as Code solutions primarily using Terraform
* Build, deploy, and maintain CI/CD pipelines using GitHub Actions and other tooling
* Implement and support systems for secrets management, observability, and service meshes
* Collaborate with InfoSec, Architecture, and Product Strategy teams to ensure platform alignment with business and security needs
* Write and maintain TypeScript code, including custom GitHub Actions
* Troubleshoot and support production systems at scale
* Drive operational excellence and uphold best practices for security, scalability, and resilience
Core Skills & Experience
* 5+ years of experience in DevOps, Site Reliability, or Systems Administration
* 3+ years of experience with cloud computing (AWS and/or Azure in a production environment)
* Expertise in Terraform and a strong understanding of Infrastructure as Code principles
* Real-world experience developing TypeScript applications (both backend and automation scripts)
* Proficient with GitHub workflows, GitHub Actions (especially custom-written in TypeScript)
* Strong Unix/Linux system administration background
* Experience with high-traffic, high-availability web systems
* Scripting and automation using TypeScript, Go, Python, or Ruby
* Strong understanding of CI/CD practices, with experience in systems like GitHub Actions, GitLab CI, or Jenkins
* Experience with monitoring and observability tools like Prometheus, Grafana, and ELK
* Familiarity with secrets management and service mesh technologies
* NoSQL experience (e.g., Cassandra is a plus)
Cloud & DevOps Tooling
* AWS services, including:
* API Gateway
* AppSync (preferred)
* Azure cloud tools and networking fundamentals
* GitHub & GitHub Actions (must be proficient)
* IaC: Terraform (primary), Puppet, or CloudFormation
* Observability: Prometheus, Grafana, ELK, etc.
* CI/CD: GitHub Actions, Jenkins, GitLab CI
* Security tooling, firewalls, ACLs, and compliance workflows
* Collaboration with security/compliance teams (e.g. InfoSec)
Software Development Skills
* Strong hands-on development with TypeScript (backend and DevOps tooling)
* RESTful API and GraphQL API design and implementation
* OpenAPI specification knowledge
* Python experience is a plus