Overview
We are looking for a Site Reliability Engineer who views "manual effort" as a bug to be fixed. In this role, you won't just be keeping the lights on; you will be the architect of our system's resilience. We need a proactive engineer who is obsessed with Kubernetes and Cloud infrastructure, but also has a visionary streak - someone eager to experiment with AI-driven operations (AIOps) to predict failures and automate responses. If you enjoy building self-healing systems and staying ahead of the tech curve, this is the place for you.
* Engineering Reliability: Designing and implementing self-healing infrastructure using Kubernetes to maintain high uptime and system integrity.
* Scaling Cloud Ecosystems: Optimizing our cloud footprint (AWS/GCP/Azure) to ensure our platforms can handle rapid growth without breaking a sweat.
* Innovating with AI: Proactively identifying opportunities to integrate AI tools into our observability stack to automate incident detection and root-cause analysis.
* Eliminating Toil: Writing clean, efficient code to automate repetitive operational tasks, turning manual workflows into seamless "set and forget" processes.
* Defining Observability: Building advanced monitoring and alerting frameworks that provide deep insights into system health and performance.
* Kubernetes Power User: Extensive experience managing production-grade K8s environments, including ingress, service mesh, and container security.
* Cloud Infrastructure Expert: A deep understanding of cloud networking, storage, and compute services within a major provider (AWS, Azure, or GCP).
* Proactive Mindset: An engineer who doesn't wait for a ticket; you naturally seek out system weaknesses and build solutions to strengthen them.
* AI Curiosity: An active interest in the AI landscape and a desire to leverage LLMs or machine learning to improve SRE workflows.
* Programming Literacy: Ideally experience with at least one language (such as Java, Python, Go, or Ruby) to bridge the gap between software engineering and operations.
* Matillion is the intelligent data integration platform.
About Matillion and Values
We’re changing how the world works with data - and we need driven, curious people who think big and move fast. We built the Data Productivity Cloud to supercharge data productivity, and now we’re shaping the future of data engineering with Maia - our AI-powered virtual data engineers that help teams design, build, and manage data pipelines at unmatched speed. Join #TeamGreen, where the mission comes first, collaboration drives us forward, and everyone pulls in the same direction to make a dent in the universe bigger than ourselves.
Benefits and Compensation
At Matillion, we are committed to providing competitive salaries in line with market standards. Our estimated compensation range for this position is £49,600 - £74,400, but the final salary will be based on your relevant skills, experience and qualifications demonstrated in the hiring process. We operate a flexible working culture that promotes work-life balance, with benefits including:
* Company Equity
* 30 days holiday + bank holidays
* 5 days paid volunteering leave
* Health insurance
* Life Insurance
* Pension
* Access to mental health support
Thousands of enterprises trust Matillion for a wide range of use cases from insights and operational analytics, to data science, machine learning and AI. We are a truly global workforce, dual headquartered in Manchester, UK and Denver, Colorado, with expanding offices in Hyderabad, India, along with valuable remote colleagues around the world.
#J-18808-Ljbffr