Engineer the future of global finance. At Citi, our Tech team does not simply support finance—we are helping to redefine it. Every day, $5 trillion crosses through our network and we operate in 180+ countries with a scale few can match. From deploying advanced AI to shaping global markets, we build systems that matter. Join a team where your work influences economies, your ideas drive innovation, and your growth is backed by mentorship, continuous learning and flexible hybrid work opportunities. Help solve real‑world challenges that touch millions and build the future of finance with Citi Tech.
As a Program Manager for Strategic Initiatives at the Vice President level, you will play a critical role in executing our firm‑wide strategies, focusing on enterprise resiliency and recoverability. You will lead and drive key workstreams, ensuring successful delivery of projects that enhance the resilience of our critical applications and support broader strategic goals. This role governs the implementation of enhanced testing, recovery, and reporting capabilities, keeping our critical business services within defined impact tolerances and minimizing client impact.
Responsibilities include:
Implement Enhanced Testing and Recovery:
* Lead and govern the implementation and execution of Production Swing testing for critical applications, ensuring the application runs from its alternate site for a minimum of 5 days.
* Drive the implementation and oversight of Data Recovery testing, ensuring applications can recover critical data from backup solutions within the defined Impact Tolerance (ITOL).
* Drive the onboarding of critical applications to the One‑Touch Recovery orchestration solution.
* Develop and execute strategies to minimize the Recovery Time Actual (TRTA) for critical applications.
Design and Architecture:
* Serve as a key champion for resilient application design, advocating for and integrating resiliency principles into architectures, and driving adoption of established resiliency patterns.
* Leverage cloud-native services and features to enhance application resiliency, including auto‑scaling, load balancing, and disaster recovery services.
* Explore and implement chaos engineering practices to proactively identify and address system weaknesses under stress.
* Partner with IO owners and platform teams to expand OTR capabilities across diverse technology stacks through API development and integration.
Proactive Vulnerability Management:
* Proactively identify vulnerabilities through regular architecture reviews, comprehensive scenario testing, and foundational testing.
* Document and demonstrate mitigation efforts for all discovered vulnerabilities, developing remediation plans, implementing necessary changes, and validating effectiveness.
* Ensure that all identified vulnerabilities have remediation plans scheduled.
Operational Resilience Adherence:
* Govern and ensure all critical applications adhere to operational resilience testing and recovery requirements.
* Collaborate with stakeholders to define and maintain appropriate impact tolerances for critical business services.
* Ensure adherence to regulatory requirements for operational resilience, including MAS, OCC, and other jurisdictional mandates.
Performance Measurement and Reporting:
* Monitor and report on key resilience metrics, including the number of applications executing production swing tests, the number on One‑Touch Recovery, recovery times and adherence to operational resilience requirements.
* Provide regular updates to senior management on the status of resilience initiatives and key performance indicators.
* Drive the development of resiliency dashboards and self‑service reporting capabilities to provide transparency into program progress and application resiliency posture.
Key Qualifications:
* Experience in software engineering, site reliability engineering (SRE), or technology risk and controls.
* Experience in a program or project management role delivering complex, cross‑functional technology initiatives.
* Proven expertise in analyzing complex application, database, network, and OS issues across distributed, large‑scale, customer‑facing systems.
* Strong understanding of resiliency principles, including disaster recovery, data recovery, and high‑availability architecture.
* Excellent communication skills and a proven ability to work effectively across multiple business and technical teams.
* Bachelor’s degree in Computer Science, Engineering, or an equivalent field.
What we’ll provide you:
By joining Citi, you will not only be part of a business casual workplace with a hybrid working model (up to 2 days working at home per week), but also receive a competitive base salary (annually reviewed) and enjoy a host of additional benefits such as:
* 27 days annual leave (plus bank holidays)
* A discretionary annual performance‑related bonus
* Private Medical Care & Life Insurance
* Employee Assistance Program
* Pension Plan
* Paid Parental Leave
* Special discounts for employees, family, and friends
* Access to an array of learning and development resources
Alongside these benefits, Citi is committed to ensuring our workplace is a place where everyone feels comfortable coming to work as their whole self every day. We want the best talent around the world to be energized, motivated to stay, and empowered to thrive.
Citi is an equal‑opportunity employer, and qualified candidates will receive consideration without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, or any other characteristic protected by law.
If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity, please review Accessibility at Citi.
View Citi’s EEO Policy Statement and the Know Your Rights poster.
#J-18808-Ljbffr