Location: Nottingham Trent House, Nottingham, Nottinghamshire, United Kingdom.
Title: Senior Software Development Engineer - SRE
About the Role
We're looking for a Senior Engineer to join our Site Reliability Engineering (SRE) team. This role is ideal for a skilled Java engineer with a passion for understanding how complex systems work, analysing performance, and applying engineering solutions to make them more efficient, stable, and scalable. You'll lead on planning and implementing key SRE initiatives, optimise and automate how our systems operate, and improve observability through better monitoring and logging. You'll also work closely with your peers to drive consistency and high standards across SRE and the wider engineering community, so a real enthusiasm for influencing others and leading by example is essential.
What you’ll do
* You will technically lead a group of engineers responsible for maintaining and improving our back‑end services for our most critical customer and/or data applications within the cloud.
* Through your team and hands‑on engineering, help to shape and deliver the goals and vision set out by your Site Reliability Engineering Manager (SREM).
* Contribute to the major optimisation and improvement themes within the team. Identifying opportunities to reduce operational overheads through observability and service automation.
* Drive engineering best practice (e.g., Operational Excellence, Security, Quality, Resilience etc.) and set standards across the team and wider SRE community.
* Innovate within your team and contribute within your technical domain.
* Deliver key pieces of intent from inception through to design and hands‑on delivery, in collaboration with your SREM.
* Estimate, scope and plan your work within an agile environment.
What we’re looking for
* Production‑grade experience of developing and managing APIs using Java & Springboot. Excellent understanding of SOLID principles, SOA, HTTP and REST.
* Good knowledge of Maven or Gradle.
* Production experience running microservices on container orchestration architectures.
* Passionate about API testing utilising tools similar to Wiremock, Pact, Postman and Cucumber.
* Good understanding of API authentication and authorisation mechanisms like OAuth.
* Knowledge using monitoring & logging tools to troubleshoot problems in a distributed network of systems.
* Excellent understanding of infrastructure as code and CI/CD pipeline management via Jenkins.
* Development experience within a cloud environment.
* Strong experience in back‑end architectural and design patterns.
* Able to effectively communicate and partner across engineering to maximise inner‑sourcing opportunities and reduce waste.
* Extensive knowledge to successfully identify, troubleshoot and pre‑empt operational issues.
* Proactive approach to problem determination, identifying areas for improvement and system bottlenecks.
* Strong communication skills with proven ability to challenge and motivate technical teams.
What you’ll get to learn
* Broadening your knowledge of the cloud (specifically AWS).
* Scripting languages (e.g. Python, BASH).
* Site Reliability Engineering.
* Solving real world problems and being comfortable working in a complex regulated environment.
Where and how you'll work
* This is a permanent position based in our Nottingham office.
* We have a hybrid working model which gives you flexibility to work from our offices and from home.
* We’re big on collaboration and connection, so you’ll be based in our Nottingham office 3 days a week on Tuesdays, Wednesdays and Thursdays.
Capital One is committed to diversity in the workplace.
#J-18808-Ljbffr