The University of Sheffield provides an innovative environment for IT research and innovation. This role focuses on designing, deploying, and operating high-performance computing platforms across diverse research domains.
Overview
As Research Platforms Engineering Lead, you will manage a small team of technical specialists and oversee the development and implementation of scalable, cost‑effective research IT platforms. You will lead strategy and delivery for the University's high‑performance computing (HPC) systems, cloud services, and data storage solutions, ensuring secure, high‑availability services for researchers.
Main Duties And Responsibilities
* Lead, manage and coach the Research Platforms Engineering team.
* Design and develop scalable research IT platforms that meet the diverse needs of university research groups.
* Provide and maintain compute platforms capable of running a wide spectrum of research applications, capable of scaling in capacity and capability.
* Ensure high‑performance storage that enables I/O‑intensive compute tasks and secure storage of research data.
* Develop and refine cloud services providing scalable, secure, and innovative research IT solutions.
* Implement and execute the Research and Innovation IT platform strategy, creating new platforms to support the university’s research requirements.
* Provide technical guidance to infrastructure teams, ensuring the necessary resources support the research IT strategy and roadmap.
* Collaborate with the Research Computing Support team to offer technical support and advice to research groups, facilitate migration from legacy clusters to modern solutions, and advise on optimal compute environments.
* Configure scheduling software, implement policies for resource allocation, and support research groups in purchasing dedicated access to computing resources.
* Design solutions that lower entry barriers for users with limited HPC experience.
* Ensure integration of research platform provision with core infrastructure, delivering operational support and strategic staffing for research computing platforms.
* Collaborate with colleagues to inform the direction of the R&I product area and contribute to broader strategic initiatives.
* Perform additional duties consistent with the role’s grade and remit.
Person Specification
The University values diversity and encourages all candidates who meet the essential criteria to apply. Please reference the application criteria in your statement.
Criteria
Essential Or Desirable
Stage(s) assessed at
* Significant experience of designing, running, and supporting research platforms and infrastructure, including HPC clusters and public/private cloud systems.
* Deep knowledge and experience managing HPC and/or public cloud (AWS) technologies, including job schedulers (e.g., Slurm) and containerisation (e.g., Kubernetes).
* Knowledge and experience of running multi‑user systems and managing services for demanding users.
* Proven ability to manage teams to deliver reliable and robust compute platforms and develop new compute services.
* Experience of planning and managing routine and project activities, team members, and budgets with complex drivers and dependencies.
* Ability to work with budgets, predicting and projecting funding, skills, and team size needed to achieve strategic goals.
* Understanding of information and cyber security on high‑performance platforms and cloud infrastructure; ISO27001 exposure is an advantage.
* Ability to develop effective relationships with external suppliers of hardware, software, and related services.
* Experience of academic research processes and activities; a PhD or equivalent research background is desirable.
Criminal record
A basic DBS check may be required for this role. Possession of a criminal record is not an automatic bar to employment; each case is evaluated individually.
We are a Disability Confident Employer
If you have a disability and meet the essential criteria for this job you will be invited to take part in the next stage of the selection process.
#J-18808-Ljbffr