Senior Linux HPC Systems Administrator / Engineer
Location: Stevenage, Hertfordshire (hybrid/onsite)
Onsite: 3 days/week onsite + able to attend site at short notice (hands-on hardware support)
My client is looking for a Senior Linux HPC SysAdmin/Engineer (minimum 10 years’ enterprise IT experience) to support a Linux-based high-performance/scientific computing environment. You’ll provide hands-on onsite support to technical/scientific users, maintaining critical infrastructure and high-end workstations.
Key responsibilities
* Administer, configure and support Red Hat Enterprise Linux (RHEL 8/9) environments (stability, performance, security)
* Support high-end workstations and resolve hardware/software issues onsite
* Support HPC environments including clustering and workload management (Slurm)
* Monitor and troubleshoot performance issues (including GPU / networking impact)
* Use ServiceNow for incident/change/ticket management and drive process improvements
* Manage SSL certificates and assist with web server configuration where required
* Work closely with stakeholders and vendors; communicate technical topics clearly and build strong working relationships
Required skills / experience
* 10+ years enterprise IT, with strong hands-on RHEL (8 & 9) administration
* Experience supporting scientific users/applications and/or research computing environments
* HPC exposure: Slurm, clusters, and general HPC operations/support
* Strong troubleshooting across Linux, hardware and applications
* Confident stakeholder communication (onsite support is a key part of the role)
Nice to have
* ServiceNow experience
* Broader knowledge of networking, performance monitoring tooling, and GPU technologies