Job Description
Halian Technology looking for a talented and driven Site Reliability Engineer (SRE) to join our growing technology team. In this role, you’ll ensure the reliability, scalability, and performance of our digital platforms that support memorable customer experiences across the hospitality sector. You’ll work alongside our engineering, product, and infrastructure teams to build high-availability systems and automated operations that support the future of digital hospitality.
Key Responsibilities:
* Drive system reliability, availability, and performance through engineering excellence.
* Design and implement monitoring, alerting, and observability tools using platforms like Datadog.
* Automate operational tasks using scripting, Infrastructure as Code (IaC), and configuration management tools.
* Troubleshoot incidents, lead root cause analysis, and improve Mean Time to Resolution (MTTR).
* Partner with software engineers to integrate reliability best practices into the development lifecycle.
* Build and maintain CI/CD pipelines to streamline deployments and rollbacks.
* Ensure infrastructure meets security and compliance standards.
* Optimise system resources for both performance and cost-effectiveness.
* Contribute to incident response and participate in on-call rotations.
* Track and improve key SRE metrics such as error rates, incident count, and monitoring coverage.
What You’ll Bring:
* 3+ years of experience in Site Reliability Engineering, DevOps, or equivalent roles.
* Strong skills in cloud-based infrastructure (Azure or AWS) using IaC practices.
* Hands-on experience building and managing CI/CD pipelines and developer tooling.
* Deep understanding of distributed systems and debugging complex technical issues.
* Proficient in observability platforms like Datadog or similar.
* Knowledge of security principles and integration of security into infrastructure design.
* Proven experience with event-driven architectures and building highly available (HA) and disaster recovery (DR) compliant systems.
* Strong grasp of software development standards and practices like TDD, BDD.
* Excellent collaboration and communication skills with a proactive and positive attitude.
* A Computer Science degree or equivalent experience.
* Certifications in Azure, AWS, or relevant platforms are a plus.
* An interest in AI and emerging technologies is welcome.
Apply now to join a forward-thinking technology team where reliability, innovation, and customer impact go hand in hand.