Overview
You: As a Manager III on the AWS Neuron team, you'll lead a team of compiler engineers through developing, deploying, and scaling a compiler targeting AWS Inferentia and Trainium. You’ll be a technically capable, credible, and curious partner to AWS ML services teams, involved in pre-silicon design, bring new products/optimizations/features to market, and ensuring the Neuron SDK meets performance, cost, and usability expectations.
Key context: The AWS Neuron software stack includes an ML compiler, runtime, and tight integration with popular ML frameworks (PyTorch, TensorFlow, MxNet). The team operates within Annapurna Labs, focusing on silicon and software innovation for AWS customers.
Relocation: In order to be considered, candidates must be currently located or willing to relocate to Toronto.
Responsibilities
* Lead a team of compiler engineers to develop, deploy, and scale a compiler for AWS Inferentia and Trainium.
* Partner with AWS ML services teams; communicate technically and strategically as a hands-on manager.
* Contribute to pre-silicon design and drive new product features, optimizations, and improvements to the Neuron SDK.
* Apply deep knowledge of resource management, scheduling, code generation, optimization, and instruction architectures (CPU, NPU, GPU, and novel compute forms).
* Collaborate across teams to ensure the Neuron SDK delivers high performance, low cost, and ease of use for customers.
Basic Qualifications
* 3+ years of engineering team management experience
* 6+ years of experience working directly within engineering teams
* 4+ years designing or architecting systems (design patterns, reliability, scaling)
* Experience partnering with product or program management teams
* Excellent software design fundamentals, strong knowledge of software engineering principles, and deep understanding of compilers (resource management, instruction scheduling, code generation, compute graph optimization)
Preferred Qualifications
* M.S. or Ph.D. in Computer Science or related technical field
* Experience with toolchains (LLVM, GCC) and code generation techniques for new hardware
* Knowledge of compiler internals from front end to run-time with emphasis on AI acceleration
Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability, or other legally protected status. If you require a workplace accommodation during the application or hiring process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.
Posted: September 25, 2025 (Updated 42 minutes ago)
#J-18808-Ljbffr