AI Infrastructure / AI Operations Engineer (Contract)
Location: Edinburgh, UK
Onsite: 3 days per week – mandatory / Remote
Start: ASAP
Duration: 12-24 months (extension very likely)
Language: English (must-have)
What you’ll do
* Build and operate AI / ML infrastructure used in production
* Support model deployment, monitoring and scaling
* Automate workflows around training, evaluation and deployment
* Work with GPU-based systems, distributed compute and CI/CD pipelines
* Partner closely with data scientists and engineers to keep AI systems stable and fast
What you bring
* Strong background in AI Ops, MLOps, DevOps or Infrastructure Engineering
* Hands-on experience with Linux, automation, scripting (Python/Bash)
* Experience with distributed systems and compute-heavy environments
* Familiarity with containers & orchestration (Docker, Kubernetes or similar)
* Comfortable operating onsite in Edinburgh 3x/week
Nice to have
* GPU / CUDA experience
* Exposure to HPC or large-scale AI platforms
* Monitoring & observability tools (Prometheus, Grafana, etc.)