Join to apply for the Serverless LLM Architect role at Huawei Technologies Research & Development (UK) Ltd
About Huawei Research And Development UK Limited
Huawei is a leading global provider of information and communications technology (ICT) infrastructure and smart devices. We have 207,000 employees and operate in over 170 countries and regions, serving more than three billion people around the world.
Our vision and mission is to bring digital to every person, home and organization for a fully connected, intelligent world. To this end, we will drive ubiquitous connectivity and promote equal access to networks; bring cloud and artificial intelligence to all four corners of the earth to provide superior computing power where you need it, when you need it; build digital platforms to help all industries and organizations become more agile, efficient, and dynamic; redefine user experience with AI, making it more personalized for people in all aspects of their life, whether they’re at home, in the office, or on the go.
This spirit of innovation has led Huawei to work in close partnership with leading academic institutions in the UK to develop and refine the latest technologies. With a shared commitment to innovation and progress, both parties have worked together to achieve common goals and establish a strong partnership.
Huawei’s vision is a fully connected, intelligent world. To achieve this, we work to inspire passion for basic research around the world. Our combined passion drives development across the global innovation value chain. Huawei has the largest Research and Development organization in the world with 96,000+ employees in research centers around the globe.
Huawei Research And Development UK Limited Overview
We continue to explore and define new research directions and new services. We have expanded our collaborations with academic researchers; researched new network architectures, integration of communications and key enabling technologies; and developed the fundamental theories of these technologies.
Job Summary
As a pioneer in global technological innovation, Huawei is committed to advancing the development of information technologies and has made remarkable achievements in server and device services, showcasing its strong technological innovation and market reach.
Joining the Huawei Serverless LLM team, you will be in cutting-edge fields such as AI infrastructure, data systems, artificial intelligence, and cloud computing. You will work side by side with global expert teams to meet hundreds of millions of service requirements.
Key Responsibilities:
* Use serverless methods to ensure excellent performance of the LLM service in high-concurrency scenarios, optimize the response speed and resource consumption of the LLM service, and achieve high throughput and low latency in inference.
* Explore the next-generation distributed inference engine to ensure high reliability, scalability, and O&M convenience of the system and support large-scale LLM commercial use in the future.
* Track the latest LLM optimization technology to ensure model performance while effectively reducing computing costs, improving loading efficiency, and achieving ultimate system throughput.
* Identify and define future-oriented technical challenges in the serverless LLM field, and enhance technical communication and cooperation with European academia.
* Work closely with cross-functional teams to participate in the innovation of AI infrastructure, data systems, and cloud computing technologies, and promote the commercial application and implementation of Huawei's serverless LLM architecture.
Person Specification:
Required:
* Understand the principles and architecture design of LLMs. Have strong experience in LLM optimization and servitization, including technologies for reducing resource consumption and response delay.
* Have a basic command of the distributed system framework and serverless architecture. Have a good command of the core concepts of distributed computing.
* Have experience in designing and optimizing large-scale distributed cluster systems. Have a basic command of common serverless technologies such as on-demand invoking, automatic expansion, and load prediction and balancing.
* Innovation and technical breakthrough: Be able to independently solve complex technical problems, have the spirit of team leadership and collaboration, be bold in taking responsibilities, and be able to work closely with cross-functional teams to promote the application and commercialization of serverless LLM technology.
Desired:
* Experience in LLM algorithm optimization is preferred.
* Papers or project achievements related to cutting-edge serverless technologies, and experience in publishing at AI or cloud computing conferences is preferred.
* Familiar with bottom-layer architectures such as distributed systems and OSs is preferred.
What We Offer
* 33 days annual leave entitlement per year (including UK public holidays)
* Group Personal Pension
* Life insurance
* Private medical insurance
* Medical expense claim scheme
* Employee Assistance Program
* Cycle to work scheme
* Company sports club and social events
* Additional time off for learning and development
#J-18808-Ljbffr