Job Overview
Engineer the future of global finance. At Citi, our Tech team doesn't just support finance - we are helping to redefine it. Every day, $5 trillion crosses through our network. We do business in 180+ countries operating at a scale few can match. From deploying advanced AI to helping shape global markets, we build systems that matter. Look to join a team where your work helps influence economies, your ideas can drive innovation and outcomes, and your growth is backed by mentorship, continuous learning and flexibility with potential hybrid work opportunities. Help solve real-world challenges that touch millions and get the opportunity to build the future of finance with Citi Tech.
Responsibilities
Implement Enhanced Testing and Recovery
* Lead and govern the implementation and execution of Production Swing testing for critical applications, ensuring applications run from their alternate site for a minimum of 5 days.
* Drive the implementation and oversight of Data Recovery testing, ensuring applications can recover critical data from backup solutions within the defined Impact Tolerance (ITOL).
* Drive the onboarding of critical applications to the One‑Touch Recovery orchestration solution.
* Develop and execute strategies to minimize the Recovery Time Actual (TRTA) for critical applications.
Design and Architecture
* Serve as a key champion for resilient application design, advocating for and integrating resiliency principles into architectures, and driving the adoption of established resiliency patterns.
* Leverage cloud‑native services and features to enhance application resiliency. This includes services for auto‑scaling, load balancing, and disaster recovery.
* Explore and implement chaos engineering practices to proactively identify and address system weaknesses under stress.
* Partner with IO owners and platform teams to expand OTR capabilities across diverse technology stacks through API development and integration.
Proactive Vulnerability Management
* Proactively identify vulnerabilities through regular architecture reviews, comprehensive scenario testing, and foundational testing.
* Document and demonstrate mitigation efforts for all discovered vulnerabilities. This includes developing remediation plans, implementing necessary changes, and validating the effectiveness of mitigations.
* Ensure that all identified vulnerabilities have remediation plans scheduled.
Operational Resilience Adherence
* Govern and ensure that all critical applications adhere to operational resilience testing and recovery requirements.
* Collaborate with relevant stakeholders to define and maintain appropriate impact tolerances for critical business services.
* Ensure adherence to regulatory requirements for operational resilience including MAS, OCC, and other jurisdictional mandates.
Performance Measurement and Reporting
* Monitor and report on key resilience metrics, including the number of applications executing production swing tests, the number of applications on One‑Touch Recovery, recovery times and adherence to operational resilience requirements.
* Provide regular updates to senior management on the status of resilience initiatives and key performance indicators.
* Drive the development of resiliency dashboards and self‑service reporting capabilities to provide transparency into program progress and application resiliency posture.
Key Qualifications
* Experience in software engineering, site reliability engineering (SRE), or technology risk and controls.
* Experience in a program or project management role, delivering complex, cross‑functional technology initiatives.
* Proven expertise in analyzing complex application, database, network, and OS issues across distributed, large‑scale, customer‑facing systems.
* Strong understanding of resiliency principles, including disaster recovery, data recovery, and high‑availability architecture.
* Excellent communication skills and a proven ability to work effectively across multiple business and technical teams.
* Bachelor's degree in Computer Science, Engineering, or an equivalent field.
Benefits
* 27 days annual leave (plus bank holidays)
* Discretionary annual performance related bonus
* Private Medical Care & Life Insurance
* Employee Assistance Program
* Pension Plan
* Paid Parental Leave
* Special discounts for employees, family, and friends
* Access to an array of learning and development resources
Job Family Group: Technology
Job Family: Applications Support
Time Type: Full time
Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.
If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity, review Accessibility at Citi.
View Citi's EEO Policy Statement and the Know Your Rights poster.
#J-18808-Ljbffr