About Us
Amelco Ltd are a leading gaming and gambling solution software provider with a strong presence in the USA, UK, and Europe. Through partnerships with global gaming companies, we build cutting-edge technical platforms across sportsbooks, lottery, casino, virtual gaming, and financial trading.
Our vision is to shape the future of gaming by transforming operations into intelligent, data-driven solutions that deliver exceptional customer experiences and create sustainable value for all stakeholders.
We believe in teamwork, knowledge sharing, and transparency with accountability.
The Role
* We’re looking for a Site Reliability Engineer (SRE) to help shape and drive how we build and operate reliable, observable, and cost-efficient systems.
* You’ll work closely with our development, platform, and incident management teams to define what “reliable” means in measurable terms — and build the tooling and processes to achieve it.
* Your work will directly influence the speed, stability, and scalability of our platform.
Key Responsibilities
* Partner with development teams to define and manage SLOs/SLIs, and use error budgets to guide engineering decisions.
* Enhance observability — ensuring metrics, logs, and tracing are in place to detect and fix issues proactively.
* Lead cost optimisation initiatives: monitor spend, rightsize workloads, tune autoscaling, and drive efficient infrastructure usage.
* Strengthen production readiness with pre-deployment checks, post-release validation, and robust platform guardrails.
* Introduce and run chaos engineering experiments to improve system resilience.
* Automate operational processes to reduce manual intervention across the stack.
* Contribute to major incident response, providing engineering expertise.
* Collaborate cross-functionally to raise the bar on platform stability, security, and performance.
Required Skills & Experience
* 3+ years in SRE, Platform, or DevOps roles.
* Strong operational experience with Kubernetes (on-prem and AWS EKS).
* Proven track record defining and working with SLOs/SLIs in production environments.
* Deep understanding of observability (metrics, logging, tracing, telemetry).
* Experience with Terraform and GitOps workflows.
* Proficient scripting skills in Python, Bash, or Go.
* Demonstrated ability to balance cost efficiency and system reliability.
* Excellent communication skills and ability to work across multiple teams.
* Experience running chaos engineering experiments.
* Exposure to high-throughput, low-latency systems.
* Experience with FinOps or cost management practices.
* AWS Certifications (e.g., Solutions Architect, DevOps Engineer).
#J-18808-Ljbffr