We are seeking a highly skilled Dev Ops Engineer to join our growing platform team. This role is critical in designing, building, and maintaining our robust, scalable, and secure cloud platform, which supports our core applications and AI/ML models. The ideal candidate will have deep expertise in cloud technologies (Azure), Infrastructure as Code (IaC), container orchestration, and CI/CD pipelines. An understanding of MLOps and data pipeline tools is also essential for integrating with our AI components.
Core Responsibilities
Cloud Platform & Infrastructure Management
* Design, implement, and manage secure and high-performance cloud infrastructure on the Microsoft Azure ecosystem.
* Utilize Terraform to automate the provisioning, configuration, and management of cloud resources, ensuring consistency and reliability.
* Optimize cloud costs and performance through best practices in resource management and architecture.
Container Orchestration
* Lead the deployment, scaling, and management of applications and services using Kubernetes and containerisation technologies like Docker.
* Manage infrastructure and resource allocation for all containerised applications and data systems.
CI/CD and Automation
* Develop and maintain robust CI/CD pipelines to automate the build, test, and deployment of both our front-end and back-end services.
* Implement monitoring, logging, and alerting solutions to ensure the health and performance of our platform.
* Promote a culture of automation across the development lifecycle.
MLOps and Data Pipeline Integration
* Collaborate with data scientists and engineers to support the operationalization of machine learning models.
* Possess a strong understanding of data pipeline orchestration tools such as Apache Airflow and their integration with cloud infrastructure.
* Assist in the deployment and monitoring of self-hosted LLMs.
Cross-Functional Collaboration:
* Work closely with stakeholders, including data scientists, software engineers, and business analysts, to align technical solutions with business needs.
* Mentor junior engineers and foster a culture of continuous learning within the team.
Skills / Qualifications
Education
* Bachelor's/Master’s/PhD degree in Computer Science, Engineering, or a related field; or equivalent professional experience.
Experience
* 5+ years of experience in DevOps, Cloud Engineering, or a related field.
* Proven track record of designing and managing cloud infrastructure on Microsoft Azure.
* Demonstrated expertise with Terraform for infrastructure provisioning.
* Hands-on experience with Kubernetes for container orchestration in a production environment.
* Experience with building and managing CI/CD pipelines (e.g., Jenkins, GitHub Actions, Azure DevOps).
* Familiarity with MLOps practices and tools including Apache Airflow is a significant plus.
Technical Skills:
* Proficiency in programming languages such as Bash and Python
* Strong understanding of version control systems (e.g., Git).
* Solid knowledge of networking, security, and cloud-native services.
Soft Skills:
* Strong problem-solving skills and the ability to work in a fast-paced, collaborative environment.
* Excellent communication and documentation skills.
* Strong analytical mindset with attention to detail.
Company Overview
Element is one of the fastest growing testing, inspection and certification businesses in the world. Globally we have more than 9,000 brilliant minds operating from 270 sites across 30 countries. Together we share an ambitious purpose to ‘Make tomorrow safer than today’.