An exceptional opportunity to join an innovative, high-growth organisation shaping the future of AI-powered automation and digital interaction.
We're seeking a Machine Learning Engineer with full-stack development experience to work on cutting-edge projects involving Generative AI, Retrieval-Augmented Generation (RAG), and multi-agent reasoning frameworks.
This is a hands-on, end-to-end engineering role with impact across the full ML lifecycle - from experimentation to deployment.
Conversational AI & Reasoning:Design, fine-tune, and deploy advanced LLMs with agentic capabilities
RAG Pipelines:Build and optimise scalable pipelines for structured and unstructured data retrieval
LLM Training & Fine-Tuning:Use methods like LoRA, QLoRA, SFT, PEFT, and RLHF
Inference & Acceleration:Serve models using vLLM, DeepSpeed, Triton, TensorRT
Multi-Agent Orchestration:Work with LangChain, AutoGen, CrewAI, DSPy and similar tools
Cloud & MLOps (AWS):Deploy with SageMaker, Bedrock, Lambda, S3, ECS, EKS
Full-Stack Integration:Build APIs (FastAPI, Flask) and integrate with React, TypeScript, Node.js
Vector Search:Use FAISS, Weaviate, Pinecone, ChromaDB, OpenSearchRequired skills & experience:
3-5+ years of experience in ML engineering and software development
Deep Python proficiency, with PyTorch, TensorFlow or Hugging Face
Proven experience with LLMs, RAG, and deploying cloud-native AI on AWS
Strong full-stack skills (React, TypeScript, Node.js) and API development
Familiarity with vector databases and multi-agent frameworksApply now to join this high growth and award-winning organisation with the opportunity to be part of building the future of AI driven projects and solutions. The role offers a highly competitive salary and benefits package and will be office based in Manchester.
INDAMS