Own and establish the cloud and DevOps foundation for a production‑grade Agentic IVR platform, including GCP project setup, environment isolation, CI/CD pipelines, security guardrails, observability, resilience, and operational readiness.
This role is focused on platform, deployment, and SRE enablement rather than application or AI model development.
Key Responsibilities
* Provision and manage multiple GCP projects aligned to DEV / INT / PRE‑PROD / PROD / Canary environments
* Implement strict environment isolation, IAM boundaries, service accounts, and VPC design
* Enforce UK/EU region constraints, network egress controls, and private connectivity
* Set up sandbox projects for CXAS (Google CX Agent studio and Dialogflow) agent with no bank system connectivity
Infrastructure as Code & Guardrails
* Own Terraform‑based IaC for project creation, IAM, networking, logging, secrets, and service enablement
* Implement policy‑as‑code guardrails and ensure immutable promotion across environments
* Prevent manual drift and enforce compliance through automated controls
* CXAS agent artefacts and configuration promotion
* Middleware services (GKE / Cloud Run / OCP)
* Infrastructure pipelines with approval gates
* Embed security and quality checks into pipelines
* Support canary, shadow, dual‑run, and rollback deployment strategies
Observability, SRE & Production Readiness
* Define and implement centralised logging, metrics, and distributed tracing
* Establish SLOs, SLIs, and error budgets aligned to call success and latency
* Define alerting, runbooks, incident response, and operational handover processes
* Lead production readiness and resilience reviews
Security, Compliance & Governance
* Embed DevSecOps practices aligned to banking controls
* Implement secrets management, audit logging, and evidence capture
* Support Responsible AI operational controls through traceability and logging
Cross‑Functional Collaboration
* Act as DevOps/SRE lead across Voice AI, Middleware, Testing, and Governance streams
* Contribute to deployment strategy, RACI, and integrated programme planning
Required Skills & Experience
* Strong hands‑on experience with Google Cloud Platform (project/IAM/networking/logging/security)
* Deep expertise in Terraform and Infrastructure as Code (including docker, k8s manifests, helm charts)
* CI/CD tooling experience (GitHub Actions, Jenkins, Argo CD, Harness or equivalent)
* Kubernetes (GKE and/or OpenShift) and containerised deployments
* Experience implementing canary, blue/green, and dual‑run deployment strategies
* Strong SRE mindset with experience in SLOs, observability, resilience, and incident management
* Experience in regulated banking or financial services environments
* Exposure to contact‑centre, IVR, or telephony‑integrated platforms
* Operational experience supporting AI or agent‑based platforms (non‑model development)
#J-18808-Ljbffr