An exciting opportunity to join a small, high-impact AI team building an internal AI Platform that powers the next generation of intelligent products and services.
You’ll work on cutting-edge AI infrastructure, collaborate with experienced platform engineers, and make a real impact on how AI innovation scales across the organization.
What you’ll be doing:
* Building and maintaining a Kubernetes-hosted AI platform (AKS)
* Deploying and managing LLMOps tools such as LiteLLM, Langflow, and Langfuse
* Implementing observability with Prometheus, Grafana, and Loki
* Managing infrastructure through Terraform, ArgoCD, and GitHub Actions
* Supporting internal AI applications including RAG, document processing, and internal AI assistants
What you’ll need:
* 2–4 years in Platform or DevOps Engineering (Azure preferred)
* Strong experience with Kubernetes, Docker, and Terraform
* Programming or scripting skills in Python or Go
* Familiarity with GitOps, Helm, and observability tools
* A learning mindset and interest in LLM operations