Position Description:
The Space, Defence and Intelligence business unit in CGI is a true IT Systems Integrator. We work, build, and operate bespoke, technically complex, mission-critical systems which help our clients keep us all safe and secure. We bring innovation to our clients using proven and emerging technologies, agile delivery processes and our deep expertise across the breadth of space, defence, intelligence, aerospace and maritime, all underpinned by our end-to-end cyber capability. We work collaboratively with global technology companies, cutting edge SMEs and academia to deliver the optimal solution for each client.
CGI was recognised in the Sunday Times Best Places to Work List and has been named one of the ‘World’s Best Employers’ by Forbes magazine. We offer a competitive salary, excellent pension, private healthcare, plus a share scheme (3.5% + 3.5% matching) which makes you a CGI Partner not just an employee. We are committed to inclusivity, building a genuinely diverse community of tech talent and inspiring everyone to pursue careers in our sector, including our Armed Forces, and are proud to hold a Gold Award in recognition of our support of the Armed Forces Corporate Covenant. Join us and you’ll be part of an open, friendly community of experts. We’ll train and support you in taking your career wherever you want it to go.
*** Applicants Must be solely UK National and already hold HMG HLC clearance ***
Role Location: Gloucester or Manchester
We are seeking a highly skilled and motivated Site Reliability Engineers to join our team. The ideal candidates will possess a good understanding of engineering principals, and broad understanding of full-stack software technologies, with hands-on expertise in application development, and tooling within a secure/on-prem environment, combined with a passion for application of best practices.
Your future duties and responsibilities:
•Architect, Build & Operate Cloud Infrastructure
Design and deploy scalable, secure, and fault tolerant cloud environments across AWS, Azure, or GCP—optimising for performance, availability, and cost efficiency.
•Enterprise Cloud Migrations
Lead migrations of legacy systems (e.g. lift and shift, re architecture) seamlessly to the cloud with minimal downtime.
•Automation & Infrastructure as Code (IaC)
Use Terraform, CloudFormation, Ansible, or similar tools to automate cloud resource provisioning, CI/CD pipeline deployments, and configuration management.
•Security & Compliance Oversight
Implement IAM, encryption, VPC/NSG policies and ensure compliance with standards (e.g. GDPR, ISO, SOC 2) across cloud environments.
•Monitoring, Optimization & Cost Governance
Continuously monitor workloads using tools like CloudWatch, Prometheus, Datadog; drive performance tuning and cost optimisation (rightsizing, reserved instances, auto scaling).
•Disaster Recovery & Business Continuity Planning
Develop and test backup/DR strategies, restore drills, and self healing infrastructure to ensure reliability and uptime.
•Collaboration & Knowledge Sharing
Work closely with DevOps, development, security and operations teams; prepare architecture/design documents, network diagrams, runbooks and training materials.
Required qualifications to be successful in this role:
•Cloud Platforms: Hands on experience with AWS, Azure, or Google Cloud Platform.
•Infrastructure Automation: Proficiency with Terraform, CloudFormation, Ansible or equivalent IaC tools.
•Containerisation & Orchestration: Experience deploying and managing Docker and Kubernetes clusters (EKS, AKS, GKE or on prem).
•Programming / Scripting: Competent in Python, Bash, PowerShell or similar, for automation and tooling.
•Networking & Storage: Strong understanding of VPC architecture, subnets, firewalls, load balancers, and storage tiers.
•DevOps & CI/CD: Experience building pipelines with Jenkins, GitLab CI/CD, GitHub Actions or Azure DevOps.
•Security & Compliance: Implement and monitor IAM, encryption, audit logging, network isolation, and compliance frameworks.
•Monitoring & Optimization Tools: Familiarity with CloudWatch, Grafana, Datadog, Prometheus, ELK or similar
The position requires team members to work from client-site to ensure the reliability and availability of critical systems.
Skills:
1. DevOps
2. English
3. GitLab
4. Kubernetes
5. Cloud Native Development