Principal Site Reliability Engineer (Platform / DevOps)
Location: Wokingham (Reading) - Hybrid (60% remote / 40% onsite)
Duration: 6 Months+
Rate: £500-£520 per day
Clearance: Active SC Clearance required (mandatory)
Overview
We are seeking an experienced Principal SRE / Platform Engineer to lead platform-first initiatives focused on scalability, reliability, and performance across distributed systems. This role requires strong DevOps expertise and the ability to design and maintain resilient cloud-based infrastructure.
Key Responsibilities
* Lead platform-first engineering initiatives to enhance scalability and reliability
* Design, build, and maintain resilient infrastructure for distributed systems
* Implement monitoring and alerting solutions to ensure high availability
* Collaborate with engineering teams to improve system reliability and mitigate risks
* Develop and maintain CI/CD pipelines to support efficient deployments
* Recommend ongoing improvements to platform architecture and processes
* Ensure compliance with security, governance, and regulatory standards
Required Skills & Experience
* Strong background in software engineering for large-scale distributed systems
* Proficiency in Golang, Java, or Python
* Hands-on experience with AWS, Azure, or GCP
* Deep knowledge of Kubernetes and container orchestration
* Proven experience with CI/CD and infrastructure automation
* Excellent troubleshooting and communication skills
#J-18808-Ljbffr