Role Title: Platform Engineer
Location: Milton Keynes
Duration: Until 31/07/2026
On‑site Requirement: 2 days per weekRole OverviewWe are seeking a highly skilled Platform Engineer to operate, improve, and scale our Kubernetes platforms across AWS, Azure, and on‑prem environments. You will drive platform reliability, lead incident response, embed automation, and ensure strong security andernance across our estate.
What You’ll Do
1. Manage and enhance Kubernetes platforms across hybrid cloud environments.
2. Lead incident response, problem management, and root‑cause analysis.
3. Deliver full cluster lifecycle activities: upgrades, patching, node pools, CNI/CSI, ingress, and Rancher operations.
4. Own observability, dashboards, alerting, and SLO/SLI management.
5. Implement GitOps (Fleet) and automate to reduce operational toil.
6. Apply secure API gateway and WAF patterns.
7. Support distributed system design including event brokers and async messaging.
8. Maintain platform security: CVE remediation, GRC controls, and CI/CD scanning.
Your Skills
9. Strong expertise in Kubernetes, Rancher, GitOps, Linux, and cloud networking.
10. Solid understanding of API gateway and WAF architectures.
11. Experience working with distributed systems and event‑driven patterns.
12. Proficient in automation/scripting (Python, Go, Bash, PowerShell, .NET).
13. IaC experience with:Terraform for foundational cluster provisioning.Crossplane for orchestration leveraging Terraform providers.
14. Ability to operate securely within PCI DSS and GDPR frameworks.
15. CI/CD tooling knowledge: Concourse, GitHub Actions, Azure DevOps.
16. Observability expertise: Grafana, Prometheus, Jaeger/Tempo, CloudWatch, Loki, OpenTelemetry.
Nice to Have
17. Hands‑on AWS operational experience.
18. Service mesh exposure (Istio/Kuma).
19. Hybrid cloud experience (AWS + Azure + on‑prem).
20. Background in payments or regulated industries.
What Success Looks Like
21. High levels of platform uptime and stability.
22. Reduced operational effort through automation and GitOps.
23. Predictable, smooth upgrade and maintenance cycles.
24. Lower CVE counts and reduced platform risk.
25. Faster and more streamlined onboarding of new tenants and workloads.
If this role interests you, please apply.
#4773185 - Komal Meenu