Job Description
An exceptional opportunity to join an innovative, high-growth organisation shaping the future of AI-powered automation and digital interaction.
We’re seeking a Machine Learning Engineer with full-stack development experience to work on cutting-edge projects involving Generative AI, Retrieval-Augmented Generation (RAG), and multi-agent reasoning frameworks.
This is a hands-on, end-to-end engineering role with impact across the full ML lifecycle – from experimentation to deployment.
* Conversational AI & Reasoning: Design, fine-tune, and deploy advanced LLMs with agentic capabilities
* RAG Pipelines: Build and optimise scalable pipelines for structured and unstructured data retrieval
* LLM Training & Fine-Tuning: Use methods like LoRA, QLoRA, SFT, PEFT, and RLHF
* Inference & Acceleration: Serve models using vLLM, DeepSpeed, Triton, TensorRT
* Multi-Agent Orchestration: Work with LangChain, AutoGen, CrewAI, DSPy and similar tools
* Cloud & MLOps (AWS): Deploy with SageMaker, Bedrock, Lambda, S3, ECS, EKS
* Full-Stack Integration: Build APIs (FastAPI, Flask) and integrate with React, TypeScript, Node.js
* Vector Search: Use FAISS, Weaviate, Pinecone, ChromaDB, OpenSearch
Required skills & experience:
* 3–5+ years of experience in ML engineering and software development
* Deep Python proficiency, with PyTorch, TensorFlow or Hugging Face
* Proven experience with LLMs, RAG, and deploying cloud-native AI on AWS
* Strong full-stack skills (React, TypeScript, Node.js) and API development
* Familiarity with vector databases and multi-agent frameworks
Apply now to join this high growth and award-winning organisation with the opportunity to be part of building the future of AI driven projects and solutions. The role offers a highly competitive salary and benefits package and will be office based in Leicestershire.