Senior Site Reliability Engineer – Moneycorp
Moneycorp is a global payments ecosystem transforming from a consumer‑focused foreign exchange provider to a cloud‑native payment infrastructure. This role plays a key part in shaping the future of our payments and FX platforms.
Key Responsibilities
* Define and maintain SLOs/SLIs and error budgets for critical services.
* Build and improve observability pipelines (metrics, logs, traces) and maintain dashboards for golden signals.
* Develop incident runbooks, lead post‑incident reviews, and drive root‑cause analysis.
* Implement anomaly detection, predictive monitoring, and forecast capacity for cloud workloads.
* Automate backup, restore, failover processes and validate RTO/RPO through regular DR testing.
* Design and run chaos engineering experiments and enhance self‑healing automation.
* Lead SEV‑1/SEV‑2 incidents, authorize critical decisions, and eliminate toil through automation.
* Map dependencies for key business services, conduct scenario‑based resilience testing, and produce compliance evidence.
* Identify and refactor platform reliability issues, engineer modern replacements, and lead migrations with measurable outcomes.
Skills, Qualifications and Experience Required
* 7+ years in SRE, platform, or systems roles with production ownership of high‑availability, low‑latency platforms.
* Deep experience with Azure services (IaaS, AKS, VNets, App Gateway, SQL, Service Bus, Event Hubs, Kafka, Key Vault) and IaC with Terraform.
* Strong background in security‑by‑design, Zero Trust principles, and regulatory compliance.
* Experience with Azure DevOps or GitHub Actions for CI/CD pipelines.
* Hands‑on knowledge of Prometheus, Grafana, OpenTelemetry, and alerting policies.
* Experience with FinOps practices, cost optimization, and cloud commercials.
* Led SEV‑1/SEV‑2 incident management and post‑mortem delivery.
* Designed and validated disaster recovery, chaos engineering, and automated resilience testing.
* Proficient in Windows Server (2019/2022/2025) and Linux (RHEL/Ubuntu) on Azure IaaS.
* Familiarity with payments orchestration, FX workflows, and platform refactoring for scale and resilience.
* Understanding of UK regulatory expectations (FCA/PRA) for operational resilience and scenario testing.
Desirable (not essential)
* Experience with Temenos or similar core banking platforms.
Education
* Bachelor’s degree in Computer Science, Engineering, or a related technical discipline, or equivalent hands‑on experience.
* Optional certifications: Microsoft Azure AZ‑104, AZ‑400, AZ‑700; Kubernetes CKA/CKAD; HashiCorp Terraform Associate.
How to Apply
If the role sounds like you, we invite you to upload a copy of your CV by clicking on the Apply Now button.
Equal Opportunity
We're committed to creating a workplace where every individual feels valued, respected, and included. As an Equal Opportunity Employer, we actively cultivate an inclusive culture where diversity thrives and empower our colleagues to drive meaningful change through initiatives like our DE&I focus groups and value champion network.
#J-18808-Ljbffr