Overview
Associate Director of Engineering – Emerging Technology and Innovation (ETIV) - AI role at HSBC. Location: Sheffield, GB. Work style: Hybrid.
Role and Responsibilities
In this role, you will join a growing team to work with a wide range of engineers, product managers, and production support specialists supporting our Group AI offerings (e.g., speech transcription, translation, knowledge management) for scaled consumption across our Group Business and Functions. You will be a senior leader within ETIV, acting as a regional Production Support Lead to provide consultancy, practice and guidance to application teams to ensure the stability, reliability, and performance of our production systems and critical applications.
* Production Support: 24x7 production support to attend system alerts, recovery, and operational tasks to ensure system reliability and availability; participate in rotating on-call support duties; manage escalations and proactively prevent service outages.
* Incident Management and Problem Management: Oversee incident response and drive timely resolution and recovery to minimise service degradation/outage time. Conduct post-incident reviews for root cause analysis and improvement actions.
* Change Management: Drive proper change management and approval processes for all platform changes, ensuring proper planning and execution.
* Automation, Monitoring and Visualization: Drive automation of operational tasks, set up monitoring to detect issues early, and create visualizations (e.g., dashboards) to understand system health in real time.
* Capacity Management: Collaborate with the platform team to ensure the platform is fit for future demand.
* Best Practices and Collaboration: Establish reliability best practices and collaborate with platform and capability engineering teams to implement them in development and operations.
* Security and Vulnerability: Ensure the platform and systems comply with security controls and patching requirements.
* Continuous Improvement: Identify improvement opportunities and drive their implementation with proper reporting to maintain site reliability and availability.
* SRE: Lead adoption of SRE principles, including SLIs, SLOs, and error budgets to improve service reliability and performance.
* Collaboration: Work with engineering, QA, and product teams to incorporate operational requirements into the application lifecycle and promote reliability-focused engineering practices.
* Leadership: Mentor and guide junior SRE and production support team members, fostering ownership, continuous learning, and excellence.
Technical Qualifications
Technical
* DevOps
* Containerization (Docker, Kubernetes)
* Designing and operating scalable, secure, and highly available platforms (e.g., Kubernetes, GKE, EKS, or OpenShift)
* Container orchestration and CI/CD
* AI / GenAI platforms and workloads, including ML pipelines, model serving/inference, GPU/accelerator resource management
* Java, Python, Go, or Bash
* Configuration management (e.g., Terraform, Ansible) to enable infrastructure as code and automation
* Reliability and performance improvements, including incident management, problem management, change planning and control, and capacity management
* Translation of strategies & plans to achieve business and functional goals
* Senior stakeholder management
* Relationship management
Behavioural Skills
* Customer Oriented
* Outcome Oriented
* Problem Solver
* Team management
Cognitive Skills
* Divided attention
* Quantitative
* Critical thinking
* Collaboration
* Logic and reasoning
This role is based in Sheffield on a hybrid basis.
HSBC is committed to creating diverse and inclusive workplaces. We are an equal opportunity employer and consider applicants for all positions without regard to race, color, religion, sex, national origin, age, disability, or any other legally protected status. If you need accommodations during the recruitment process, please contact our Recruitment Helpdesk.
Details
* Seniority level: Director
* Employment type: Full-time
* Job function: Engineering and Information Technology
#J-18808-Ljbffr