An exceptional opportunity for a Machine Learning Engineer (with Full-Stack experience) to join an innovative market leader at the forefront of developing next-generation solutions that transform digital interactions. The role will focus on projects to leverage state-of-the-art generative AI, retrieval-augmented generation (RAG), and reasoning frameworks to build intelligent and context-aware systems.
We are seeking talented Machine Learning Engineers with full-stack software development experience to join our client's team and help shape the future of AI-powered automation. Within this dynamic role varied duties will include:
Search relevancy engineering.
Conversational AI Development: Design, train, fine-tune, and deploy LLMs with reasoning capabilities.
Retrieval-Augmented Generation (RAG): Implement, optimise, and scale RAG pipelines for effective information retrieval from structured and unstructured sources.
Model Fine-Tuning & Training: Train domain-specific models using techniques like LoRA, QLoRA, PEFT, reinforcement learning, and supervised fine-tuning (SFT).
Model Deployment & Inferencing: Optimise model serving and inference using vLLM, DeepSpeed, TensorRT, Triton, and other acceleration frameworks.
Multi-Agent Systems: Develop and integrate agentic capabilities using frameworks such as LangChain, CrewAI, AutoGen, and DSPy.
AWS Cloud & MLOps: Deploy scalable machine learning workloads on AWS using services like SageMaker, Bedrock, Lambda, S3, DynamoDB, ECS, and EKS.
End-to-End AI Product Development: Work across the full ML lifecycle, from data collection and preprocessing to model evaluation, deployment, and monitoring.
Full-Stack Integration: Develop APIs and integrate ML models into web applications using FastAPI, Flask, React, TypeScript, and Node.js.
Vector Databases & Search: Implement embeddings and retrieval mechanisms using Pinecone, Weaviate, FAISS, Milvus, ChromaDB, or OpenSearch.Required skills & experience:
3-5+ years in machine learning and software development
Proficient in Python, PyTorch or TensorFlow or Hugging Face Transformers
Experience with RAG, LLM fine-tuning, and expertise in AWS and cloud-native AI deployments.
Full-stack experience (React, TypeScript, Node.js) and API development.
Familiarity with vector search and multi-agent orchestrationApply now to join this high growth and award-winning organisation with the opportunity to be part of building the future of AI driven projects and solutions. The role offers a highly competitive salary and benefits package and will be office based in Leicestershire.
MLE(phone number removed)AM
INDAM