Social network you want to login/join with:
Site Reliability Engineer III - Corporate Oversight and Governance Technology, London
Client:
Location: London, United Kingdom
Job Category: Other
-
EU work permit required: Yes
Job Reference: 2ca1828910c1
Job Views: 2
Posted: 17.05.2025
Expiry Date: 01.07.2025
Job Description:
Elevate your engineering skills by joining a team of talented professionals and positioning yourself among the top in site reliability.
As a Site Reliability Engineer at JPMorgan Chase within the Corporate Oversight and Governance (COG), Architecture & Engineering team, you will collaborate with stakeholders to define non-functional requirements (NFRs) and availability targets for your application and product lines. You will ensure these NFRs are incorporated into your products’ design and testing phases, that your service level indicators effectively measure customer experience, and that service level objectives are set with stakeholders and implemented in production. You will solve complex coding problems with a quality-driven, product-centric approach.
Corporate Oversight and Governance Technology develops solutions supporting Compliance, Controls Management, Resiliency, Legal, Regulatory, and Audit functions. These solutions aid in independent review, monitoring, and oversight of business operations, focusing on legal and regulatory obligations related to the firm’s products and services.
Architecture and Engineering is a cross-functional group within Corporate Oversight & Governance Technology, covering engineering practices, architectural governance, and data management, providing guidance, setting mandates, and delivering solutions.
Job responsibilities
* Contribute to creating high-quality designs, roadmaps, plans, standards, and program charters that are delivered by you, your team, or the wider COGT engineering community.
* Promote and demonstrate site reliability culture, principles, and practices daily, championing the adoption of site reliability engineering.
* Collaborate to create and implement observability and reliability designs for complex systems that are robust, stable, and minimize toil and technical debt.
* Design, create, and advocate for SRE products to scale the implementation of SRE best practices within COGT.
* Develop and debug critical components of applications and platforms.
* Engage with JPMorgan Chase’s site reliability community through forums, communities of practice, guilds, and conferences.
* Participate in architecting, designing, and building highly distributed systems and SRE products, solving complex coding problems.
* Maintain and promote best practices in software engineering, leading by example.
Required qualifications, capabilities, and skills
* Applied experience with SRE concepts, strategies, and culture.
* Knowledge of observability tools such as OTEL, Grafana, Dynatrace, Prometheus, Datadog, Splunk, including monitoring, service level objectives, alerting, and telemetry collection.
* Proven experience with Java and Spring Boot.
* Familiarity with software design patterns applicable to reliability.
* Understanding of the software delivery lifecycle and associated tooling, including branching and testing strategies.
* Experience with developing containerized, serverless, and event-driven systems.
* Ability to anticipate, identify, and troubleshoot defects during testing.
J-18808-Ljbffr