Cloud Operations Engineer
Check all associated application documentation thoroughly before clicking on the apply button at the bottom of this description.
£30,000 to 50,000 GBP
Bonus
Onsite WORKING
Location: Cheltenham, Gloucester, South West - United Kingdom Type: Permanent
Cloud Operations Engineer / Lead Engineer (Multiple Roles)
Location: Cheltenham (Onsite, 5 days per week)
Levels: Multiple hires (junior to senior)
Eligibility: UK Citizen and eligible for SC clearance
Working Pattern: 24/7 shift-based operational environment
Package: Competitive salary depending on experience plus shift allowance
Overview
We are hiring multiple Cloud Operations Engineers and Lead Engineers to join a highly secure, mission-critical cloud operations team.
The role is open to a broad range of backgrounds, including Computer Science graduates, Linux-focused infrastructure engineers, Kubernetes/platform engineers, and individuals from live service or service desk environments with strong incident management experience.
This is a hands-on operational engineering role focused on maintaining stability, availability, and performance of a complex, secure cloud platform operating at scale.
Key Responsibilities
* Provide frontline operational support for secure cloud infrastructure and platform users
* Troubleshoot and resolve critical incidents across live production systems
* Lead or support incident response, escalation, and coordination during shifts
* Operate within a 24/7 rota supporting high-priority workloads and services
* Follow, maintain, and improve operational runbooks and incident procedures
* Identify opportunities to reduce operational toil and improve service reliability
* Support mentoring and knowledge sharing for junior engineers (senior roles)
* Engage with internal stakeholders and third parties during critical incidents
Technical Environment
* Linux (strong hands-on experience required)
* Kubernetes (deployment, troubleshooting, and platform support)
* Infrastructure as Code (Terraform or similar tools)
* Cloud-native networking and system troubleshooting
* Observability and monitoring tools
* APIs and integration services
* Secure, restricted, air-gapped cloud environments
Required Experience
* Strong experience working with Linux-based systems in production environments
* Background in live service support, infrastructure operations, or platform engineering
* Experience troubleshooting system, application, or network-level issues
xcswzye
* Exposure to Kubernetes and/or containerised environments
* Understanding of infrastructure, networking, and operational support principles
* Ability to operate in high-pressure, incident-driven environments
* Willingness to learn and operate within highly secure cloud architectures
Desirable Experience
* Kubernetes administration or advanced troubleshooting experience
* Infrastructure as Code experience (Terraform or similar)
* Exposure to observability and monitoring platforms
* Experience working in 24/7 operational environments
* Prior experience coordinating shifts or leading small technical teams
deep expertise in secure cloud operations, Kubernetes platforms, and large-scale infrastructure engineering.
Reference: AMC-AQU-COEC
Postcode: GL52
#adqu