Role Overview As a Cloud Operations Engineer, you will be the primary line of defence and support for high-stakes, secure cloud environments. You will bridge the gap between complex infrastructure and user success, ensuring the stability of critical UK workloads. This role is ideal for a technical problem-solver who thrives in high-security, air-gapped environments and is passionate about Infrastructure as Code (IaC) and continuous service improvement.
Key Responsibilities
In this role, you will be expected to:
* Frontline Technical Support: Act as the first point of contact for secure cloud users, troubleshooting and resolving critical technical issues to maintain mission-critical operations.
* Incident & Request Management: End-to-end management of support tickets, ensuring every query is documented with precise diagnosis, resolution steps, or appropriate escalation.
* High-Priority Incident Response: Execute documented runbooks and procedures for high-priority incidents, specifically supporting potential UK critical national infrastructure workloads.
* SME Development: Build deep, specialized knowledge in cutting-edge air-gapped cloud technology to provide expert \"how-to\" guidance to the user community.
* Operational Excellence: Participate in internal reviews to identify \"toil\" and manual bottlenecks, developing strategies for automation and continuous service improvement.
Essential Skills & Experience
The successful candidate must demonstrate:
* Linux Mastery: Extensive experience working with computer systems and networks, with a deep functional command of the Linux operating system.
* Production Support: A proven track record in a production support or live operations role, maintaining high-availability services for end-users.
* Cloud Native Tooling: Hands-on experience with Kubernetes and Infrastructure as Code (IaC) tools, specifically Terraform.
* Network Troubleshooting: Ability to deconstruct complex network architectures and resolve connectivity issues using standard Linux networking tools.
* Analytical Problem Solving: Strong experience in troubleshooting multifaceted technical issues and identifying underlying bugs or system failures.
* Growth Mindset: A proactive willingness to learn and master deep technical skills within specialized air-gapped cloud architectures.
Critical Requirements
* Security Focus: Experience or comfort working within highly regulated or secure environments.
* Process Discipline: Ability to follow strict operational procedures while maintaining the agility to suggest improvements to existing runbooks.