Site Reliability Engineer (SRE) - Payments London, England, United Kingdom Software and Services
SRE and Engineering Operations Engineers in the team take part in every aspect of the software development lifecycle. We work in a fast-paced environment and are responsible for hands-on coding of critical system components. We work closely with privacy and security engineering teams to ensure that the products we build go above and beyond on both fronts. We also partner closely with quality and testing teams, and understand that their success is ours as well. Onboarding will be easier for you if you have hands-on experience with Java or another JVM-based language, and experience developing highly available, high throughput, distributed systems. Some other tech that’s relevant to us is workflow orchestration, relational and non-relational databases, message queueing, application container orchestration, and cloud deployment.
Production Experience in operationalizing large scale distributed, fault-tolerant, multi-tenant services.
Experience building systems both on-premise (data center) and on public cloud (AWS, GCP, or Azure welcome).
Understanding of core SRE concepts - Monitoring, Alerting, Incident management.
Strong background in leading multi-functional projects.
Experience handling large numbers of diverse systems with configuration management systems like Puppet, Chef, Ansible, or Salt.
#