Elevate your engineering prowess to unprecedented levels by joining a team of exceptionally gifted professionals and position yourself among the top echelon in site reliability.
As a Site Reliability Engineer at JPMorgan Chase within the Corporate Oversight and Governance (COG), Architecture & Engineering team, you will work collaboratively with stakeholders to define non-functional requirements (NFRs) and availability targets for your application and product lines. You will ensure these NFRs are incorporated into your products' design and testing phases, that your service level indicators effectively measure customer experience, and that service level objectives are defined with stakeholders and implemented in production. You will solve complex coding problems with a quality-driven, product-centric approach.
Corporate Oversight and Governance Technology develops solutions supporting Compliance, Controls Management, Resiliency, Legal, Regulatory, and Audit functions. These solutions facilitate independent review, monitoring, and oversight of business operations, focusing on legal and regulatory obligations related to the firm's products and services.
Architecture and Engineering is a cross-functional group within Corporate Oversight & Governance Technology, focusing on engineering practices, architectural governance, and data management. It provides guidance, sets mandates, and delivers solutions.
Job responsibilities
1. Create high-quality designs, roadmaps, plans, standards, and program charters delivered by you or your team, or the wider COGT engineering community.
2. Promote a site reliability culture by demonstrating principles and practices daily and championing their adoption.
3. Collaborate to design and implement observability and reliability solutions for complex systems that are robust, stable, and minimize toil and technical debt.
4. Design, create, and advocate for SRE products to scale the implementation of SRE best practices within COGT.
5. Evolve and debug critical components of applications and platforms.
6. Contribute to JPMorgan Chase's site reliability community through forums, communities of practice, guilds, and conferences.
7. Participate in architecting and building highly distributed systems and SRE products, solving complex coding problems.
8. Maintain and promote best practices in software engineering, leading by example.
Required qualifications, capabilities, and skills
* Applied experience with SRE concepts, strategies, and culture.
* Knowledge of observability tools such as OTEL, Grafana, Dynatrace, Prometheus, Datadog, Splunk, including monitoring, SLIs, alerting, and telemetry collection.
* Proven experience in Java and Spring Boot.
* Competency in at least one programming language: Go, Python, TypeScript, or JavaScript.
* Familiarity with software design patterns relevant to reliability.
* Understanding of the software delivery lifecycle and related tooling, including branching and testing strategies.
* Experience developing containerized, serverless, and event-driven systems.
* Agile practitioner.
* Ability to anticipate, identify, and troubleshoot defects during testing.
About the Team
J.P. Morgan is a global leader in financial services, providing strategic advice and products to prominent clients worldwide. Our client-centric approach drives our success. We value diversity and inclusion, recognizing that our people are our strength. We are an equal opportunity employer and accommodate various needs and practices. For more information on accommodations, visit our FAQs.
#J-18808-Ljbffr