We are partnering with a leading organisation in the data and analytics space to recruit an experienced Senior Site Reliability Engineer. This is an opportunity to join a highly collaborative, technically strong SRE function working on large‑scale, cloud‑native platforms that support high‑volume, high‑speed data services.
The team is expanding due to increased workload, and this role will become the eighth member of an established, supportive engineering group. You’ll play a key part in driving cloud automation, improving system reliability, and supporting critical production environments.
Key Responsibilities
Build, maintain, and improve AWS cloud infrastructure
Develop automation using Terraform, Ansible, and Python
Support incident response and troubleshoot performance issues
Deliver routine maintenance, including patching and upgrades
Enhance CI/CD pipelines (GitLab CI, GitHub CI)
Contribute to Agile ceremonies and take ownership of user stories
Implement new technologies and solutions to improve system reliability
What You Will Bring
Strong commercial experience with AWS (essential)
Solid understanding of Linux systems (RHEL, CentOS or similar)
Scripting skills, ideally Python
Hands‑on experience with Terraform and/or Ansible
Proficiency with Docker
Exposure to CI/CD tooling and Agile ways of working
Background in software engineering, systems engineering, or previous SRE roles
Minimum 4 years’ experience in a relevant technical discipline
Please note, this role is not suitable for candidates with Windows‑only experience or Engineers without hands‑on AWS or Linux exposure.
Remote working is supported, with an on-site presence in Nottingham, ideally once per week preferred