We're building an internal AI platform from scratch, the tooling that will define how Critical Cloud operates as we scale across Europe. This isn't a rotation or a shadow programme. From week one you'll be shipping real tooling and operating real production environments for real customers. The two tracks exist because they make each other better. That's the design.
This isn't a rotation programme. From week one, you'll contribute to both tracks: shipping AI tooling that helps us run cloud operations better, and operating real production infrastructure for real customers. Two disciplines, one engineer, no siloes.
Critical Cloud is the world's first "Powered by Datadog" accredited MSP, a Datadog-native cloud MSP built for European tech‑led SMBs. We're building an internal AI platform (the Critical Cloud Platform) to automate and augment how we operate customer environments. This role sits at the centre of that programme.
Half your time will be engineering AI‑assisted tooling: LLM integrations, agents, and automation workflows that reduce toil and improve our operational quality. The other half will be hands‑on SRE work: monitoring, incident support, infrastructure‑as‑code, and customer‑facing operations. Each half makes you better at the other.
Claude / Anthropic API – Primary LLM platform
Datadog – Core observability platform
AWS – Primary cloud, multi‑account
Azure – Secondary cloud workloads
Terraform – Infrastructure as code
GitHub Actions – CI/CD pipelines
Year 1–2
Year 2–3
Engineer II – Specialise or Broaden
Year 3+
Senior / Lead – Platform or SRE
The ideal candidate doesn't have to choose between writing code and running infrastructure. They're curious about both and understand that the two inform each other. You'll build AI tooling that automates real operational problems precisely because you've experienced those problems hands‑on in the SRE track.
We operate to ISO 27001. Everything we build, including AI systems, has to be explainable, auditable, and consistent with our governance framework. If you care about building AI tools that are reliable, not just impressive demos, you'll fit right in.
This is an early career role, but we don't run it like one. You'll have genuine ownership, direct access to founders, and the chance to shape a platform that will define how Critical Cloud operates at scale.
When something breaks in a customer environment, you take it through to resolution and document it properly. Not "I raised a ticket." Not "I told the senior." You own it.
The AI tooling track exists because engineers asked "what if we automated that?" This role rewards people who look at repetitive manual work and immediately start thinking about whether they could build their way out of it. The worst automation is the one nobody trusts because it's too complicated. Build for the on‑call engineer picking it up at 3am without context. A runbook anyone can follow is worth more than one only you understand.
You’ll hit problems on both tracks where the answer isn't in a tutorial. The engineers who thrive here figure things out, with what they have, in the time they have, to the standard required.