Site Reliability Engineer (AWS)
Belfast - Hybrid
Full-time
Ocho is proud to partner with an exclusive client to recruit a skilled Site Reliability Engineer (SRE) with deep AWS expertise. This is a fantastic opportunity to join a growing global software organisation that powers mission-critical services across government and industry.
As a key member of the engineering team, you'll play a vital role in ensuring the reliability, availability, and performance of complex cloud-based infrastructure in a 24/7 production environment.
Key Responsibilities:
* Build and manage secure, highly available AWS infrastructure.
* Automate infrastructure deployment using Terraform, CloudFormation, or Ansible.
* Implement and maintain monitoring and alerting systems with tools like CloudWatch, Prometheus, and Grafana.
* Develop CI/CD pipelines using GitHub Actions, GitLab CI, or Jenkins.
* Respond to incidents, troubleshoot, and perform root cause analysis.
* Collaborate closely with development, DevOps, and security teams.
Experience:
* 3+ years in an SRE, DevOps, or related role.
* Hands-on experience with AWS (EC2, RDS, S3, EKS, etc.).
* Skilled in infrastructure as code and scripting (Python, Bash, Go).
* Experience with Docker, Kubernetes, and modern CI/CD workflows.
* Strong problem-solving and communication skills.
* Comfortable working in fast-paced, 24/7 production environments.
Additional Benefits:
* Private Healthcare (BUPA)
* Company Share Scheme - Buy one share, get one free
* Generous Parental Leave - 26 weeks full maternity pay
* Flexible Working Options - 1 day a week onsite (Belfast)
* Income Protection & Life Assurance - Up to 4x salary
If you meet the above criteria, please apply now, alternatively feel free to reach out to Andrew Harrison directly for a confidential chat.