*OUTSIDE IR35, MARKET RATES, MUTLIPLE ROLES, SC REQUIRED*
In this role, you will:
* Design, build, and oversee production AI systems end to end, from API to infrastructure.
* Work across LLMs, classical ML, retrieval systems, rules-based logic, and cloud-native services.
* Make pragmatic architectural decisions that balance performance, reliability, and cost.
* Decide when AI adds value and when simpler or more traditional approaches are the better solution.
* Communicate trade-offs clearly to both technical and non-technical stakeholders.
LLMs are an important part of the work, but not the whole story. You will design solutions that combine multiple techniques and technologies, primarily on AWS and Azure, with a strong focus on long-term operability, observability, and cost control.
Typical Key Responsibilities
* Architect and oversee production-grade, full-stack AI systems.
* Design solutions combining LLMs, RAG, semantic search, classical ML, and rules-based components.
* Build and optimise retrieval and RAG pipelines, including vector search and indexing strategies.
* Deploy and operate self-hosted and managed models on CPU and GPU infrastructure, using tools such as Ollama, vLLM, SGLang, and AWS or Azure managed services.
* Design and build scalable Back End services and APIs, primarily in Python, with JavaScript or TypeScript as a plus.
* Leverage cloud-native AI and platform services,...