Role: Principal Site Reliability Engineer (Platform / DevOps)
Location: Wokingham (Reading) - Hybrid (60% remote / 40% onsite)
Duration: 6 Months+
Rate: £500-£520 per day
Clearance: Active SC Clearance required (mandatory)
Overview
We are seeking an experienced Principal SRE / Platform Engineer to lead platform-first initiatives focused on scalability, reliability, and performance across distributed systems. This role requires strong DevOps expertise and the ability to design and maintain resilient cloud-based infrastructure.
Key Responsibilities
Lead platform-first engineering initiatives to enhance scalability and reliability
Design, build, and maintain resilient infrastructure for distributed systems
Implement monitoring and alerting solutions to ensure high availability
Collaborate with engineering teams to improve system reliability and mitigate risks
Develop and maintain CI/CD pipelines to support efficient deployments
Recommend ongoing improvements to platform architecture and processes
Ensure compliance with security, governance, and regulatory standardsRequired Skills & Experience
Strong background in software engineering for large-scale distributed systems
Proficiency in Golang, Java, or Python
Hands-on experience with AWS, Azure, or GCP
Deep knowledge of Kubernetes and container orchestration
Proven experience with CI/CD and infrastructure automation
Excellent troubleshooting and communication skills