 
        
        Multi-Agent LLM Systems
Remote (MUST BE IN EUROPE) or Hybrid London/Barcelona
We’re partnering with a venture-backed startup led by a founder who has built and taken two technology companies to IPO, now assembling a world-class team to tackle one of the most impactful problems in applied AI.
The company is developing a voice-enabled AI copilot used by professionals to eliminate the friction from documentation and decision-making, a product with genuine, real-world impact that’s already being used in production environments.
They’re now looking for a Senior/Staff AI Engineer to own and evolve the core “brain” service behind this assistant, the system that powers reasoning, retrieval, and dialogue in real time.
Interview Process:
1️⃣ Intro call where we talk about the role.
2️⃣ Technical discussion with the Head of AI.
3️⃣ Deep-dive session with a Backend Engineer and ML Engineer from the team.
4️⃣ 30-minute conversation with the Founder.
Why This Is Worth Your Time
 * Real ownership: You’ll be the architect behind a core AI system, not a feature contributor.
 * Fast-moving environment
 * Immediate impact: Your code will run in production and support real users from day one.
 * Technical depth: Multi-agent reasoning, voice-streaming, RAG optimisation and all in one system.
 * Flexible setup: Remote across the EU, with optional co-working in London or Barcelona.
What you’ll do
 * Obsessive about latency, you think in milliseconds, optimise for concurrency, and understand the trade-offs between speed, cost, and model performance.
 * Design, implement, and productionise multi-agent LLM systems that reason, plan, and coordinate.
 * Develop FastAPI-based microservices optimised for low latency and high reliability.
 * Engineer and evaluate RAG pipelines: hybrid retrieval, re-ranking, grounding, and context validation.
 * Integrate real-time voice interfaces (STT/TTS, WebRTC, LiveKit) into intelligent conversational flows.
 * Instrument and evaluate system performance using observability and model-faithfulness metrics.
What we’re looking for
 * Proven ability to build and ship agentic or multi-agent frameworks into production.
 * Expert Python, FastAPI, and asyncio developer.
 * Practical experience with LangChain, Autogen, or custom orchestration layers.
 * Startup mindset: ownership, speed, and pragmatism over perfection.
Bonus points
 * Experience working with voice or streaming systems (STT/TTS, WebRTC, LiveKit).
 * Exposure to evaluation tooling, LLM-as-judge setups, or agent benchmarking.
 * Background in healthtech, fintech, or other compliance-heavy sectors.