Job Description
Inscope Technology:
Google Cloud Platform infrastructure:
* Compute Engine
* Red Hat Enterprise Linux (RHEL) virtual machines
* Windows virtual machines
* Local SSD
* Persistent Disk
* Cloud Networking
* Cloud Logging
* Cloud Monitoring
* Cloud Storage
* Cloud Key Management
* Cloud Secret Manager
* Dynatrace
* GitHub Enterprise
* HashiCorp products, including Terraform and Packer
* Jenkins
* Backstage
* Jira Cloud and Jira Align
Skillset Required:
* Google Cloud Platform Infrastructure Engineering: Proven experience designing, building, and operating secure, automated cloud platform capabilities, with a focus on Google compute products and services.
* Infrastructure as Code: Proficiency with Terraform (minimum), Jenkins, and modern CI/CD systems (GitHub Actions, Harness, Jenkins).
* Networking & Security: Experience with GCP Cloud Armor, GCP Networking, and embedding secure-by-design controls from design to runtime.
* Automation & Observability: Implementing actionable observability, performance tuning, and automation to reduce toil. Defining and operating against SLOs/SLIs.
* Scripting & Tooling: Scripting in Bash, PowerShell, or Python. Familiarity with HashiCorp Vault, Harness, and Backstage is desirable.
* Team Leadership, Collaboration & Mentoring: Ability to lead and motivate a team of infrastructure engineers, ensuring cross-team collaboration and mentoring.
* Certifications: Relevant GCP certifications are desirable.
Scope of services
* As an infrastructure engineering lead within the Public Cloud Services Compute (GCP DCX) team, the scope of service includes:
* Design, Build, and Operate: Deliver and maintain secure, automated Google compute capabilities, supporting Red Hat Enterprise Linux (RHEL) and Windows virtual machine and broader compute and networking products and configurations.
* Platform Enablement: Enable product teams to deliver Google IaaS solutions at pace, leveraging reusable patterns and robust integration tools.
* Infrastructure Automation: Develop and maintain Infrastructure as Code (IaC) solutions for provisioning and managing Google resources, ensuring repeatability and compliance.
* Security & Compliance: Embed security best practices and controls throughout the platform life cycle, safeguarding organisational and customer data.
* Performance & Reliability: Define, monitor, and operate against service level objectives (SLOs/SLIs), ensuring high availability, performance, and fault tolerance.
* Continuous Improvement: Drive automation, observability, and performance tuning to reduce manual effort and improve platform reliability.
* Collaboration: Work closely with architecture and feature teams to evolve the cloud roadmap and platform products, contributing to documentation and enablement.
* Team Leadership, Mentoring & Standards: Lead, motivate, and mentor a team of infrastructure engineers and uphold engineering standards, fostering a culture of continuous learning and improvement.