Description The Software Assurance Group is seeking an exceptional engineer to assume the role of Principal Cloud Engineer, with a critical focus on supporting the Machine Learning Engineering team. In this role, you will collaborate closely with ML engineers to design and implement the architecture for advanced network and platform tools that directly enhance the security, observability, and scalability of large-scale, global AI systems. We are developing robust security and quality tools that deliver automated assurance, enable continuous compliance, and provide integrated protection for AI-driven solutions deployed in production. You will take a hands-on technical leadership role in designing, building, and guiding a team responsible for developing cloud-native services tailored to the unique operational and security needs of machine learning environments. This includes solutions for secure transport and processing of ML artifacts (such as models, intermediate code, and SBOMs), scalable data collection and telemetry pipelines for real-time monitoring, and orchestration frameworks to efficiently manage infrastructure resources supporting dynamic ML workloads. Working across multi-tenant, global deployments, your architecture will ensure the team can monitor, secure, and optimize thousands of ML-powered applications in parallel, while maintaining strict adherence to compliance and privacy requirements. Responsibilities Responsibilities: Design and implement scalable, highly available cloud services, management consoles, and robust telemetry/observability. Integrate modern security tools and orchestrate large-scale workloads (using Kubernetes or similar frameworks). Automate deployments and infrastructure (e.g., via Terraform, Ansible) and enable CI/CD integration. Optimize system performance and costs. Support SLAs/SLOs and drive response for critical production issues. Champion security-first, compliant architectures, and collaborate globally with engineering and ML teams. Minimum Qualifications: BS Degree in computer science, software engineering or related fields. 8 years of experience with distributed/cloud systems on a major cloud platform (OCI, AWS, GCP, Azure) Proficiency with programming languages including Java, Python, Go, C/C++. Expertise in multi-tenant cloud applications, microservices, APIs, and Infrastructure-as-Code. Strong background in CI/CD, Linux, security design (including compliance), and working with global teams. Demonstrated experience with observability, scalability, and cost optimization in production. Eligibility to work in the United Kingdom without sponsorship is essential. Preferred Qualifications: MS or PhD Degree in computer science, software engineering or related fields. Direct experience with Oracle Cloud Infrastructure (OCI) Experience with security and quality tools. Exposure to ML/data engineering platforms and collaborating with ML teams. Experience with large-scale orchestration, monitoring (e.g., Kubernetes, Prometheus, Airflow), and disaster recovery. Background in application or cloud security and leading cross-functional technical projects. Experience with mentoring junior staff. Qualifications Career Level - IC4