Responsibilities:
1. Architectural Leadership and Strategy:Shape the design, architecture, and strategic evolution of Elanco’s HPC, storage, and networking infrastructure to meet future research demands.
2. Technology Road Mapping and Innovation: Evaluate emerging technologies, conduct proof-of-concept projects, and build business cases for new investments to keep Elanco at the cutting edge of scientific computing.
3. Mentorship and Technical Guidance: Act as a senior mentor and technical escalation point for other engineers and support staff, fostering technical excellence and knowledge sharing within the team.
4. HPC System Management: Design, deploy, configure, and maintain Elanco’s HPC clusters and associated storage and networking infrastructure.
5. Advanced Performance Optimisation: Proactively monitor system performance, troubleshoot bottlenecks, and tune the environment to ensure optimal efficiency and resource utilization.
6. User Support and Enablement: Act as the primary technical contact for our research and scientific user base, providing support, training, and guidance on how to best leverage HPC resources.
7. Automation and Tooling: Develop and maintain scripts and automation tools to streamline system administration, job scheduling, and monitoring tasks.
8. Job Scheduler Management: Manage and configure job scheduling systems to ensure fair and efficient allocation of computational resources.
9. Security and Compliance: Implement and maintain security best practices to protect sensitive data and ensure the integrity of the HPC environment.
10. Capacity Planning: Collaborate with stakeholders to forecast future computing needs and contribute to the strategic planning and evolution of Elanco’s HPC capabilities.
What You Need to Succeed (minimum qualifications):
11. Educational Background: A Bachelor's or Master's degree in Computer Science, Engineering, or a related technical field.
12. Strategic Thinking and Business Acumen: Ability to align technical strategy with business goals, develop multi-year roadmaps, and justify major technology investments.
13. System Administration:Deepexpertise in Linux/Unix system administration in a large-scale environment.
14. HPC Technologies:Broadexperience with HPC cluster management, including job schedulers and parallel file systems.
15. Scripting Proficiency:Exceptional scripting skills for automation, particularly in Python and Bash.
16. Networking Knowledge: Solid understanding of high-speed networking fabrics like InfiniBand or Omni-Path.
17. Cloud and Hardware Acumen: Familiarity with Public Cloud services, specifically Microsoft Azure and Google Cloud Platform (GCP), as well as server, storage, and networking hardware components common in HPC environments.
18. DevSecOps: Proven experience with relevant DevSecOps concepts and tooling, including Continuous Integration/Continuous Delivery (CI/CD), Git SCM, Containerisation (Docker, Kubernetes), Infrastructure-as-Code (HashiCorp Terraform).
19. Problem-Solving: Excellent analytical and troubleshooting skills, with the ability to diagnose and resolve complex technical issues efficiently.
20. Communication Skills: Strong interpersonal and communication skills, with a customer-centric approach to supporting a diverse scientific user community.
21. Leadership and Mentoring: Proven experience leading complex technical projects and mentoring junior and senior engineers.
Additional Information:
22. Travel:0-10%
23. Location: Hook, UK - Hybrid Work Environment
Don’t meet every single requirement? Studies have shown underrepresented groups are less likely to apply to jobs unless they meet every single qualification. At Elanco we are dedicated to building a diverse and inclusive work environment.If you think you might be a good fit for a role but don't necessarily meet every requirement, we encourage you to apply.You may be the right candidate for this role or other roles!