Overview
As an AI Platform Engineer, you’ll be part of the team designing and running the core AI platform. You’ll work on APIs, pipelines, observability, security, and orchestration, ensuring our AI solutions can move from experiment to production smoothly. This role is about building the foundations of AI adoption — if you enjoy combining distributed systems, cloud engineering, and AI tooling into something bigger, this is it.
You should take pride and ownership in your work, share your expertise, and be open to feedback. You’ll be expected to collaborate, coach others, and continuously improve the technology, architecture, and people you interact with. You should be able to drive excellence and safety in your team and embed these values at all levels.
What You Will Do
* Design and build the AI platform that powers LLMs, agents, and other AI solutions across Dojo.
* Develop APIs, SDKs, and tooling that allow product teams to consume AI capabilities at scale while delivering a great developer experience.
* Implement orchestration for multi-model and multi-service workflows (e.g., LangGraph, Crew AI, Google Agent Development Kit, agentic frameworks).
* Build and manage vector search and retrieval systems to support RAG and knowledge integration.
* Build robust monitoring, logging, and guardrails to ensure AI systems are safe, observable, and compliant using solutions like Langsmith, Opik, Prometheus and Grafana.
* Automate infrastructure and model deployment with Kubernetes, Terraform, and CI/CD pipelines.
* Partner with security, compliance, and product teams to ensure safe use of AI in production.
* Stay current with AI platform trends, open-source tools, and emerging patterns — bring best practices into our stack.
What You’ll Bring
* Strong software engineering or platform engineering background (Python/Go/Java; Go/Java/.NET a bonus).
* Solid experience with distributed systems, microservices, and cloud-native architecture (GCP preferred).
* Hands-on experience with Kubernetes, service mesh, and event-driven systems.
* Familiarity with LLM orchestration frameworks (LangChain, LangGraph, CrewAI, GCP ADK or similar).
* Experience with vector databases (FAISS, Pinecone, Weaviate, Vertex Vector Search) and RAG pipelines.
* Knowledge of MLOps/AI infra tools (MLflow, VertexAI, Ollama, OpenRouter etc).
* Strong CI/CD and infrastructure-as-code skills (Terraform, Helm, etc.).
* Good understanding of AI governance, monitoring, and responsible AI practices.
* Comfort balancing speed (PoCs) with robustness (production-ready systems).
Seniority level
* Mid-Senior level
Employment type
* Full-time
Job function
* Engineering and Information Technology
Industries
* Advertising Services
#J-18808-Ljbffr