Salary: £45,000 - 70,000 per year Requirements:
* Relevant academic (research Masters level) and/or industry experience
* Excellent knowledge and experience of managing an on-premise Kubernetes cluster
* Excellent knowledge of Kubeflow and similar systems, e.g. MLflow
* Good programming ability in Python with familiarity with Linux systems including scripting and system configuration
* Experience using AWS, e.g., Cognito, S3, EC2, Lambdas, etc.
* Experience with ML toolkits, e.g. PyTorch, Lightning, etc., with a solid understanding of how these fit into ML Ops pipelines and tools
* Ability to design and implement MLOps solutions covering many different technologies
* Background in DevOps with exposure to CI systems, e.g. Jenkins (desirable)
* Familiarity with infrastructure as code, e.g. Ansible (desirable)
* Experience, aptitude, and a desire to work with human motion capture, sport, animation tools and techniques (desirable)
* Familiarity with C++ (desirable)
Responsibilities:
* Join the ML Operations team to support the ML Development team in building leading-edge motion capture products
* Provision and maintain a modern ML Operations stack, covering data acquisition pipelines, data management, and ML model training infrastructure (SW and on-prem HW)
* Utilize both on-prem, self-managed systems and leverage AWS infrastructure
* Guide the technical direction of the ML Ops team and suggest new areas of development
* Lead your own project when opportunities arise
Technologies:
* AWS
* Ansible
* Cloud
* DevOps
* EC2
* Support
* Jenkins
* Kubeflow
* Kubernetes
* Linux
* PyTorch
* Python
More:
We are excited to invite an excellent ML Ops Engineer to join our research and development team, where you will play a pivotal role in our innovative ML Operations efforts. Our team focuses on building cutting-edge motion capture products and offers a collaborative environment that thrives on creativity and technical expertise. We provide a range of benefits, including opportunities for professional growth, and work with both on-prem and cloud infrastructure in a dynamic location.
last updated 1 week of 2026