We are seeking an HPC Engineer to deploy, operate, troubleshoot, and improve high‑bandwidth GPU interconnect platforms across our global data center footprint.
What You Will Do
* Deploy, operate, and support NVLink/NVSwitch platforms across large data center environments.
* Troubleshoot Linux, networking, hardware, firmware, performance, and stability issues in production.
* Build automation and improve runbooks, dashboards, alerts, and lifecycle workflows.
* Collaborate with teams across CoreWeave, external vendors, and customer-facing stakeholders.
* Drive assigned work to completion with clear communication, thoughtful prioritization, and early visibility into risks or blockers.
* Participate in on‑call, incident response, root cause analysis, and follow‑up improvements.
* Contribute to reliable workflows that scale across regions, platforms, and fleet growth, with ownership calibrated by level.
What We Are Looking For
* Strong Linux system administration and troubleshooting skills.
* Networking fundamentals and common troubleshooting tools.
* Production debugging experience using logs, metrics, and command‑line tools.
* Server, network, GPU, or data center hardware troubleshooting experience.
* Practical scripting or automation experience in Python, Go, Bash, or similar.
* Clear communication, documentation, collaboration, and on‑call readiness.
* Curiosity to learn specialized GPU interconnect technologies such as NVLink, NVSwitch, and InfiniBand.
Preferred Qualifications
* Ansible or other infrastructure automation tooling.
* Kubernetes application development or operations experience.
* Grafana, Prometheus, PromQL, or similar observability systems.
* Large fleet operations across Linux systems, network devices, GPUs, or infrastructure components.
* InfiniBand, RDMA, HPC networking, or low‑latency/high‑bandwidth fabrics.
* BMC, Redfish, IPMI, firmware lifecycle management, or hardware management APIs.
* NVLink, NVSwitch, NVIDIA GPU platforms, NVUE, SONiC, or network operating systems.
What We Offer
* Competitive salary: £79,000 to £105,000.
* Discretionary bonus and equity awards.
* Family‑level Medical and Dental Insurance.
* Generous Pension Contribution.
* Life Assurance at 4x Salary.
* Critical Illness Cover.
* Employee Assistance Programme.
* Tuition Reimbursement.
* Work culture focused on innovative disruption.
Equal Opportunity
CoreWeave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, veteran status, or genetic information.
Export Control Compliance
This position requires access to export controlled information. To conform to U.S. Government export regulations applicable to that information, applicant must either be:
* A U.S. person, defined as a U.S. citizen or national, lawful permanent resident (green card holder), refugee, or asylee.
* Eligible to access the export-controlled information without a required export authorization.
* Eligible and reasonably likely to obtain the required export authorization from the applicable U.S. government agency.
CoreWeave may, for legitimate business reasons, decline to pursue any export licensing process.
#J-18808-Ljbffr