Job Description: GCP Platform Engineer / SRE
1. In‑Scope Technologies / Products
* GCP Cloud
* GCP API Management (Apigee) – Infrastructure
* GCP Networking
* GCP Cloud Armor
* GCP Load Balancer
* GCP Kubernetes Service (GKE)
* GCP Storage
* HashiCorp Terraform
* HashiCorp Vault
* Harness
* Containers
* Backstage
2. Skillsets Required
Cloud Platform Engineering
* Proven experience designing, building, and operating secure, automated cloud platform capabilities with a focus on Azure (with readiness to cross‑train in GCP as required).
Infrastructure as Code (IaC)
* Strong proficiency with Terraform (minimum).
* Experience with modern CI/CD systems such as GitHub Actions, Harness, and Jenkins.
API Management
* Deep understanding of GCP API Management (Apigee) infrastructure and API Gateway solutions.
* Familiarity with API design and security standards (REST/OpenAPI, authentication/authorisation, mTLS, certificate lifecycle management).
Networking & Security
* Hands‑on experience with GCP Cloud Armor, GCP Networking, and implementation of secure‑by‑design controls across platform lifecycle.
Containers & Orchestration
* Experience with GKE, container runtimes, and service mesh patterns (e.g., Istio).
Automation & Observability
* Ability to implement actionable observability solutions.
* Performance tuning and automation to reduce toil.
* Experience defining and operating to SLOs/SLIs.
Scripting & Tooling
* Proficiency in Bash, PowerShell, or Python.
* Familiarity with HashiCorp Vault, Harness, and Backstage is desirable.
Collaboration & Mentoring
* Ability to mentor engineers, contribute to communities of practice, and uphold platform engineering standards.
Certifications
* GCP certifications are desirable.
3. SRE Role – Additional Requirements
* Relevant GCP certifications aligned to the service.
* Strong DevOps background with experience in IaC and CI/CD pipelines (Terraform, Jenkins).
* Ability to quickly understand, update, and write code (Python, Groovy, Bash, PowerShell).
* Demonstrable expertise in modular, maintainable IaC design.
* Strong understanding of cloud security, networking, and APIs.
* Excellent troubleshooting and problem‑solving capability.
* Hands‑on experience with observability tooling (including Observability‑as‑Code and SLO‑based Dynatrace monitoring).
* Strong understanding of source‑control practices and collaborative engineering.
* Experience developing and administering Kubernetes clusters in production.
* Strong focus on automation and toil reduction.
* Good knowledge of incident management and issue resolution processes.
* Demonstrated passion for continuous learning and engineering excellence.
4. Scope of Services
As Infrastructure Engineers within the Integration GCP team, responsibilities include:
Design, Build & Operate
* Deliver and maintain secure, automated GCP API Management platform capabilities, supporting both API Gateway and broader integration products.
Platform Enablement
* Enable product teams to deliver API‑first services using reusable patterns and robust integration tools.
Infrastructure Automation
* Develop and maintain IaC solutions for provisioning and managing Azure and GCP resources, ensuring consistency, repeatability, and compliance.
Security & Compliance
* Embed security best practices throughout the platform lifecycle to safeguard organisational and customer data.
Performance & Reliability
* Define, monitor, and operate against SLOs/SLIs to ensure availability, performance, and resilience.
Continuous Improvement
* Drive automation, observability enhancement, and performance tuning to reduce manual effort and improve reliability.
Collaboration
* Work closely with architecture and feature teams to evolve the cloud roadmap and platform products.
* Contribute to documentation, technical enablement, and engineering standards.
Mentoring
* Mentor team members and promote a culture of continuous learning.
5. SRE Role – Additional Service Responsibilities
* Hands‑on engineering ownership of IaC and CI/CD pipeline‑based products and services.
* Respond to change requests, implement enhancements, and improve reliability and customer experience.
* Observe, investigate, and resolve service issues using an engineering mindset – resolving issues through code, automation, and root‑cause elimination.
* Implement automation to remove manual toil by leveraging existing cloud tooling or integrating new technologies.