Job Description
AI Infrastructure / AI Operations Engineer (Contract)
Location: Edinburgh, UK
Onsite: 3 days per week – mandatory / Remote
Start: ASAP
Duration: 12-24 months (extension very likely)
Language: English (must-have)
What you’ll do
1. Build and operate AI / ML infrastructure used in production
2. Support model deployment, monitoring and scaling
3. Automate workflows around training, evaluation and deployment
4. Work with GPU-based systems, distributed compute and CI/CD pipelines
5. Partner closely with data scientists and engineers to keep AI systems stable and fast
What you bring
1. Strong background in AI Ops, MLOps, DevOps or Infrastructure Engineering
2. Hands-on experience with Linux, automation, scripting (Python/Bash)
3. Experience with distributed systems and compute-heavy environments
4. Familiarity with containers & orchestration (Docker, Kubernetes or similar)
5. Comfortable operating onsite in Edinburgh 3x/week
Nice to have
1. GPU / CUDA experience
2. Exposure to HPC or large-scale AI platforms
3. Monitoring & observability tools (Prometheus, Grafana, etc.)
...