Glasgow (Onsite – 5 Days a Week)
Contract – 12 Months (Inside IR35)
The Opportunity
We’re hiring an experienced Cloud Platform Engineer (AWS SRE – Observability) to join a high-performing Software Engineering team delivering for a leading global financial services client.
You’ll play a key role in improving platform reliability, monitoring, and observability across AWS-based systems, working on critical, large-scale environments.
What You’ll Be Doing
* Design, build, and maintain Grafana dashboards for real-time monitoring and insights
* Develop and enhance observability solutions across AWS infrastructure
* Define and manage SLIs, SLOs, and SLAs to measure service performance
* Implement monitoring strategies based on golden signals (latency, traffic, errors, saturation)
* Support incident management, root cause analysis, and continuous improvement
* Collaborate with engineering and operations teams to improve system resilience
* Contribute to error budget management and reliability-focused decisions
* Integrate observability into CI/CD pipelines and cloud operations
What We’re Looking For
* 6+ years’ experience in SRE / Cloud Engineering / Platform Engineering
* Strong hands-on experience with AWS
* Proven expertise in Grafana and observability tooling
* Solid understanding of:
o Monitoring, logging, and alerting
o Strong communication and stakeholder collaboration skills
Nice to Have
* Experience with Snowflake or Databricks
* Knowledge of Infrastructure as Code (e.g. Terraform)
* Exposure to modern cloud-native and DevOps practices
#J-18808-Ljbffr