Job Title: Site Reliability Engineer Department: IT Contract: Permanent Hours: 37.5 Reporting to: Chief Technology Officer Location: Southampton/ Hybrid About Us GFS is a leader in the creation of high-tech solutions for the eCommerce and Logistics market space. We provide a multi-carrier solution to small and large businesses alike, providing a single point of contact and a simple Customer focused delivery solution. We provide accurate and up to date tracking of all parcels, manage all questions and queries and seamlessly handle claims for missing and damaged goods. Our solution has helped many retailers reach customers across the UK and worldwide that were previously out of their reach. We are currently working on an exciting major platform upgrade and are seeking an experience Site Reliability Engineer to join our UK development office to establish an SRE function. About the role The successful applicant will become a part of our experienced development team, responsible for the continued design, development and operation of our innovative, market leading software. The role will be hybrid-based, with three days a week in our new Southampton office to help you get onboard, and foster collaboration within the team. Responsibilities include: Assist with the design and development of our carrier management platform, focusing on ensuring reliability, availability and performance objectives are considered. Define, establish SLI’s for critical systems, and report on SLO’s. Champion our use of DataDog as an observability and monitoring platform; establish key metrics for services and work with crossfunctional teams to creat dashboards and alerts. Conduct incident post mortems, identify root causes and take responsibility for the implementation of preventive measures to avoid future incidents. Champion SRE practices across the development function. This is a hands-on role for an experienced engineer who can act as a champion for SRE practices within a growing development team. About You We are obsessed with technology here at GFS and it shows in the ambition of what we have built; you'll share our ambition as we work with innovative technologies and understand how best to deploy them to implement our vision. You will be experienced in working with in a multi skilled team, collaborating with product owners to define and understand requirements, working with the developers and the CTO to design and develop technical solutions, and sharing your experience with the wider team to help the grow and gain experience. Required Skills Reactive Systems/Event-driven microservices patterns Docker and Compose Kubernetes, especially Google GKE Continuous Integration Exposure to a Continuous Delivery or Continuous Deployment environment Azure DevOps preferred Understanding of techniques such as Dark Launching, Feature Flagging etc. Excellent communication skills Experience of working in a distributed/remote team Experience with observability platforms such as DataDog Useful Skills and Experience Cloud computing (Google Cloud Platform or Azure preferred) Terraform Observability platforms DataDog an particular benefit Incident management solutions Some knowledge of GitOps principles