Job Description
About Us
We are an AI-native, VC-backed startup building a multimodal, proprietary foundation model with a profound understanding of retail, designed to hyper-personalise every shopper touchpoint. As we scale from research to production, we need robust infrastructure that makes our models reliable, reproducible, and observable at scale.
As a Senior MLOps Engineer, you will own the infrastructure and tooling that turns experimental models into dependable production systems. You will build the pipelines, monitoring, and deployment workflows that allow our Research Engineers to move fast without breaking things. If you want to operate at the intersection of machine learning and production systems engineering, this role is for you.
What You Will Do
1. Build and maintain CI/CD pipelines for model training, evaluation, and deployment across research, staging, and production environments.
2. Design and implement model registries, versioning systems, and experiment tracking to ensure full reproducibility of all model releases.
3. Deploy ML workflows using tools like Airflow or similar, managing dependencies from data ingestion through model deployment and serving.
4. Instrument comprehensive monitoring for model performance, data drift, prediction quality, and system health.
5. Manage infrastructure as c...