Senior researcher - efficiency for large language models

Cambridge

Microsoft

Model

Posted: 15 September

Offer description

Overview

Generative AI is transforming how people create, collaborate, and communicate - redefining productivity across Microsoft 365 and our customers globally. At Microsoft, we run the biggest platform for collaboration and productivity in the world with hundreds of millions of consumer/enterprise users. Tackling AI efficiency challenges is crucial for delivering these experiences at scale.

Within our Microsoft wide Systems Innovation initiative, we are working to advance efficiency across AI systems, where we look at novel designs and optimizations across AI stacks: models, AI frameworks, cloud infrastructure, and hardware. We are an Applied Research team driving mid- and long-term product innovations. We closely collaborate with multiple research teams and product groups across the globe who bring a multitude of technical expertise in cloud systems, machine learning and software engineering. We communicate our research both internally and externally through academic publications, open-source releases, blog posts, patents, and industry conferences. Further, we also collaborate with academic and industry partners to advance the state of the art and target material product impact that will affect 100s of millions of customers.

We are looking for a Senior Researcher - Efficiency for Large Language Models to explore model/system-level optimizations to deliver significant efficiency gains for Large Language Models and Generative AI experiences. The ideal candidate will have strong knowledge of state-of-the-art and emerging Large Language Models, LLM architectures & optimizations, as well as hands-on experience in LLM frameworks and evaluation. We are seeking someone with an interest to work at the intersection of research and product with the ambition to apply this research into a real-world setting.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. Have a look at this link for reading:

Qualifications

Required Qualifications:

1. Doctorate in Computer Science, Machine Learning, Statistics, Engineering, Mathematics, Physics, or related fieldOR equivalent experience.
2. Research experience and publications in top conferences/journals (NeurIPS, ICML, ICLR, AISTATS, ACL, EMNLP, NAACL, ISCA, MICRO, ASPLOS, HPCA, SOSP, OSDI, NSDI, etc.) in at least one of the following areas: natural language processing, statistics, machine learning, and optimization.
3. Solid knowledge of state-of-the-art and emerging Large Language Models (LLMs), including their application in complex systems.
4. Solid coding and engineering skills to design experiments and help to drive research into product.

Other Requirements:

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:

5. Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Preferred Qualifications:

6. Doctorate in Statistics, Computer Science, Engineering, Mathematics, Physics, or related field AND 2+ years related experience (, statistics predictive analytics, research) OR equivalent experience.
7. Hands on experience in improving the design and efficiency of generative AI systems and related frameworks and toolkits
8. Familiarity with LLMs such as the OpenAI GPT models, LLaMa etc., model fine-tuning techniques (LoRa, QLoRa), prompting techniques (Chain of Thought, ReACT etc.).
9. Ability to work independently and in a team, take initiative and lead engagements as required.

#M365Core #M365Research #Research

Responsibilities

10. Conduct novel research to advance the state-of-the-art in efficiency for Large Language Model / Generative AI experiences to enable their deployment at scale.
11. Work with a small group of fellow research scientists and product engineering teams to execute practical solutions for real-world impact.
12. Drive the end-to-end research agenda from establishing the problem definition to building algorithms and models.
13. Publish and contribute to top scientific conferences and journals.
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.Industry leading healthcareEducational resourcesDiscounts on products and servicesSavings and investmentsMaternity and paternity leaveGenerous time awayGiving programsOpportunities to network and connect

Apply

Create E-mail Alert

Save

Similar job

Fit model

Cambridge

Yours Clothing

Model

Similar job

Aiml - ml researcher, foundation models

Cambridge

Model

Similar job

Senior researcher - efficiency for large language models

Cambridge

Model