📏 Size | ~450 globally, ~130 in engineering, ~40 in London
🎯 Areas | ML platform, inference infrastructure, backend development
📍 Based | Zone 1, Central London
đź’» Hybrid | 3 days a week in-office
Workonomics is partnering with an AI company you may already know through their suite of popular B2C app products. Now, they're going through a significant shift to becoming an enterprise B2B platform.
They’re behind a widely-adopted open‑source model and are now focused on the harder problem: turning cutting‑edge ML research into a reliable, scalable platform used by millions. They’ve a close partnership with NVIDIA using their latest GPUs and libraries.
They’re hiring an Engineering Lead in London for their ML Platform team.
You’ll be responsible for taking research prototypes and turning them into production‑grade inference systems, built in Python, running on GPUs, deployed via Kubernetes, and operating under real consumer traffic where latency, cost, and reliability all matter.
You’ll have influence over architecture, standards, and team direction.
What they’re looking for
* A track record of owning, building, and scaling complex platforms from design to production
* Backend development expertise focused on API design and observability / monitoring
* Interest in / exposure to ML model serving and / or ML platform tooling
* Comfort with cloud‑based distributed systems
* Hands‑on technical leadership experience
This is not a research role, and not a pure people‑management position. It’s for engineers who enjoy owning complex backend systems end‑to‑end and shaping how ambitious technology works in the real world.
If this sounds like you, apply to learn more about the company and role.
#J-18808-Ljbffr