Lead MLOps Engineer - London - Permanent
📍 London, UK
(If you like the sound of this role and want to relocate - the Client is willing to help facilitate this move!)
This is a high-impact role within a fast-growing AI and robotics organisation focused on building advanced, scalable intelligent systems for real-world industrial applications. The position owns the machine learning infrastructure and MLOps foundations as products, platforms, and teams scale.
You will play a key role in transforming machine learning prototypes into reliable production systems, defining pragmatic engineering standards, and enabling fast, safe delivery of ML-powered capabilities. The role combines hands-on engineering, architectural ownership, and close collaboration with engineering and product teams.
Key Responsibilities
Own and scale the organisation's ML infrastructure and MLOps foundations
Design pragmatic, production-ready system architectures that balance speed, reliability, and cost
Build and maintain CI/CD pipelines for ML workflows and application delivery
Productionise ML models including training, evaluation, deployment, monitoring, and rollback strategies
Ensure reliability, observability, security, and performance across ML systems
Automate infrastructure provisioning, deployments, and environment management using cloud-native tooling
Partner closely with ML engineers, software engineers, and product teams to deliver ML features end-to-end
Act as a technical leader through design reviews, mentorship, and by establishing engineering best practices
Required Experience & Skills
Staff or lead-level experience in MLOps, DevOps, or Infrastructure Engineering, ideally within high-growth or startup environments
Strong Python skills with hands-on experience using modern ML frameworks (e.g., PyTorch, TensorFlow, or similar)
Experience working with major cloud platforms (AWS, GCP, or Azure)
Proven production experience with Docker and Kubernetes
Strong understanding of CI/CD systems (e.g., GitHub Actions, GitLab CI, ArgoCD)
Experience with Infrastructure as Code tools such as Terraform and Helm
Solid understanding of data engineering fundamentals and ML lifecycle management
Ability to design scalable systems without unnecessary complexity
Strong debugging and problem-solving skills in distributed systems
Ownership mindset with excellent communication and cross-functional collaboration skills
What's Offered
Competitive salary and equity participation
Paid vacation in line with local labour regulations
Opportunities for international collaboration and travel
Office benefits including meals, snacks, and team events
If you are interested - please apply directly!
Randstad Technologies Ltd is a leading specialist recruitment business for the IT & Engineering industries. Please note that due to a high level of applications, we can only respond to applicants whose skills & qualifications are suitable for this position. No terminology in this advert is intended to discriminate against any of the protected characteristics that fall under the Equality Act 2010. For the purposes of the Conduct Regulations 2003, when advertising permanent vacancies we are acting as an Employment Agency, and when advertising temporary/contract vacancies we are acting as an Employment Business