Overview
Senior Site Reliability Engineer role to join client on an initial three-month contract with strong scope for extension. Hybrid work: on site in Wokingham twice a week and remote otherwise. Inside IR35 and requires active SC clearance.
Responsibilities
* Collaborate with Agile teams to automate deployment, monitoring, and infrastructure management.
* Ensure platform and business application reliability and performance against strict SLAs and KPIs.
* Implement and maintain cloud-native observability stacks (Prometheus, Grafana, Loki, Tempo).
* Develop and maintain Infrastructure as Code (IaC) using tools like Kustomize or Helm.
* Manage CI/CD pipelines using Tekton and ArgoCD.
* Support and troubleshoot OpenShift Operators (ServiceMesh, ODF, ACS, ACM, AMQ).
* Conduct security reviews and implement controls aligned with national infrastructure standards.
* Mentor junior engineers and promote SRE best practices.
* Collaborate with vendors and IT teams for incident resolution and platform improvements.
Required Skills
* Strong communication skills (written and verbal).
* Experience in remote team collaboration.
* Deep expertise in OpenShift/Kubernetes and RedHat Linux.
* Proficiency in scripting (Bash, Python) and templating (Helm, Kustomize).
* Experience with CI/CD automation and IaC strategies.
* Security-first mindset with experience in regulated environments.
* Experience with VMware vSphere virtualization?
#J-18808-Ljbffr