Our client is seeking Principle Site Reliability Engineers for an initial six-month contract, with a hybrid work model: a few days onsite in Wokingham and the rest remotely. This role is inside IR35 and requires active SC clearance.
Key Responsibilities
1. Lead platform-first initiatives to enhance scalability, reliability, and performance.
2. Design, build, and maintain resilient infrastructure for distributed systems.
3. Implement monitoring and alerting systems to ensure high availability and performance.
4. Collaborate with engineering teams to improve system reliability and mitigate risks.
5. Develop and maintain CI/CD pipelines for seamless deployment and release management.
6. Evaluate and recommend improvements to platform infrastructure and processes.
7. Ensure compliance with security standards, governance policies, and regulatory requirements.
Required Skills & Experience
* Proven expertise in software development and engineering for large-scale distributed systems.
* Strong proficiency in programming languages such as Golang, Java, or Python.
* Extensive experience with cloud infrastructure providers (AWS, Azure, or GCP).
* Deep knowledge of container orchestration platforms like Kubernetes.
* Exceptional problem-solving skills and a passion for building scalable, secure solutions.
* Excellent communication skills for cross-functional collaboration.
Candidates with prior high-level security clearance are especially encouraged to apply, as clearance can take up to 10 weeks. Successful applicants will need to be security cleared before appointment.
LA International is a HMG-approved ICT Recruitment and Project Solutions Consultancy, operating globally. We welcome applications from diverse backgrounds and experiences.
LA International has received the Queen’s Award for Enterprise: International Trade for the second consecutive period.
#J-18808-Ljbffr