Senior Compute Library Engineer - Kernel / CUDA / OpenCL / GPU / NPUI am partnered with an incredibly exciting start up who work on AI Accelerators and RISC V technologies, who are looking to bring on a Compute Library Engineer to develop high-performance kernels for machine learning operators on NPU architectures.They are an extremely high caliber team, and have historically hired from the likes of Apple, Intel and BSC and are looking to expand their site in Cambridge, which currently has around 20 people.A bit about the role:High-performance kernel development for ML operators on NPU architecturesKernel OptimizationIntegration of kernels into the NPU frameworkUtilization of hardware features of GPU and accelerators which are specialized for AI applicationsA bit about you:5+ years of experience in kernel development for GPUs or NPUsGood experience with parallel programming languages such as CUDA or OpenCLStrong software development skillsKnowledge of ML frameworksFamiliarity with hardware architecture and system level understanding of NPUs or GPUsBy applying to this role you understand that we may collect your personal data and store and process it on our systems. For more information please see our Privacy Notice (https://eu-recruit.com/about-us/privacy-notice/)