Job title:
Senior Platform Operation Manager – VP
Company
Morgan Stanley
Job description
This position is for a Senior Platform Operation Manager of Snowflake/Gen AI/Postgres CET based in Glasgow, responsible for managing and improving the global database infrastructure services with Site Reliability Engineering (SRE) oversight.
Position Description:
* The Snowflake/Postgres Customer Engagement Team (CET) is part of Morgan Stanley's Enterprise Computing Data Services Organization, managing mission-critical distributed database platforms like Snowflake, Postgres, and Greenplum on cloud and on-premises.
* The successful candidate will serve as incident and escalation manager for the global production Data and Analytics infrastructure during EMEA hours.
* Lead projects such as data center migration, version upgrades, release management, automation, database design, and performance optimization.
* Participate in at least one squad as SRE, following Agile practices and contributing to infrastructure modernization.
* Require 10+ years of enterprise IT experience and expertise in distributed database platforms, scripting, Linux/Unix, monitoring tools, and project management.
* Must have strong communication, organizational, incident management skills, and be available for weekend work.
Key Responsibilities:
* Deploy, optimize, and manage enterprise-scale distributed database platforms.
* Respond to incidents, troubleshoot, and perform root cause analysis.
* Design and maintain disaster recovery and high-availability solutions.
* Automate operational tasks related to provisioning, monitoring, backups, and recovery.
* Monitor system health, optimize performance, and collaborate on schema and query optimization.
* Ensure data security and compliance, participate in on-call rotations.
Database Operations:
* Manage database deployment, upgrades, backups, and schema in production.
* Monitor performance and troubleshoot issues in distributed/OLTP/OLAP environments.
Infrastructure & Automation:
* Experience with cloud platforms (AWS, Azure), Infrastructure as Code tools, and scripting for automation.
Observability:
* Set up and use monitoring, logging, and alerting tools; understand SLIs, SLOs, SLAs.
High Availability & Disaster Recovery:
* Design and implement HA/DR solutions, run recovery drills.
Operational Skills:
* Incident response, root cause analysis, capacity planning, change management.
Bonus Skills:
* Experience with container orchestration, CI/CD, regulatory compliance, strategic problem-solving, modern data architectures, documentation skills.
What you can expect from Morgan Stanley:
We support a culture of excellence, diversity, and inclusion, offering attractive benefits and career growth opportunities. For more info, visit our global offices.
Note: This role may require regulatory qualifications. Morgan Stanley promotes flexible working arrangements and is an equal opportunities employer.
#J-18808-Ljbffr