hero

Find Your Dream Job Today

Senior Researcher - Efficiency for Large Language Models

Microsoft

Microsoft

Software Engineering, Data Science
Posted on Sep 15, 2025

Senior Researcher - Efficiency for Large Language Models

Cambridge, Cambridgeshire, United Kingdom

Save

Share job

Date posted
Sep 15, 2025
Job number
1876594
Work site
3 days / week in-office
Travel
0-25 %
Role type
Individual Contributor
Profession
Research, Applied, & Data Sciences
Discipline
Research Sciences
Employment type
Full-Time

Overview

Generative AI is transforming how people create, collaborate, and communicate - redefining productivity across Microsoft 365 and our customers globally. At Microsoft, we run the biggest platform for collaboration and productivity in the world with hundreds of millions of consumer/enterprise users. Tackling AI efficiency challenges is crucial for delivering these experiences at scale.


Within our Microsoft wide Systems Innovation initiative, we are working to advance efficiency across AI systems, where we look at novel designs and optimizations across AI stacks: models, AI frameworks, cloud infrastructure, and hardware. We are an Applied Research team driving mid- and long-term product innovations. We closely collaborate with multiple research teams and product groups across the globe who bring a multitude of technical expertise in cloud systems, machine learning and software engineering. We communicate our research both internally and externally through academic publications, open-source releases, blog posts, patents, and industry conferences. Further, we also collaborate with academic and industry partners to advance the state of the art and target material product impact that will affect 100s of millions of customers.

We are looking for a Senior Researcher - Efficiency for Large Language Models to explore model/system-level optimizations to deliver significant efficiency gains for Large Language Models and Generative AI experiences. The ideal candidate will have strong knowledge of state-of-the-art and emerging Large Language Models, LLM architectures & optimizations, as well as hands-on experience in LLM frameworks and evaluation. We are seeking someone with an interest to work at the intersection of research and product with the ambition to apply this research into a real-world setting.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. Have a look at this link for reading: https://www.microsoft.com/en-us/research/group/systems-innovation/

Qualifications

Required Qualifications:

  • Doctorate in Computer Science, Machine Learning, Statistics, Engineering, Mathematics, Physics, or related field
    • OR equivalent experience.
  • Research experience and publications in top conferences/journals (NeurIPS, ICML, ICLR, AISTATS, ACL, EMNLP, NAACL, ISCA, MICRO, ASPLOS, HPCA, SOSP, OSDI, NSDI, etc.) in at least one of the following areas: natural language processing, statistics, machine learning, and optimization.
  • Solid knowledge of state-of-the-art and emerging Large Language Models (LLMs), including their application in complex systems.
  • Solid coding and engineering skills to design experiments and help to drive research into product.

Other Requirements:


Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:

  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Preferred Qualifications:

  • Doctorate in Statistics, Computer Science, Engineering, Mathematics, Physics, or related field AND 2+ years related experience (e.g., statistics predictive analytics, research)
    • OR equivalent experience.
  • Hands on experience in improving the design and efficiency of generative AI systems and related frameworks and toolkits
  • Familiarity with LLMs such as the OpenAI GPT models, LLaMa etc., model fine-tuning techniques (LoRa, QLoRa), prompting techniques (Chain of Thought, ReACT etc.).
  • Ability to work independently and in a team, take initiative and lead engagements as required.

#M365Core #M365Research #Research

Responsibilities

  • Conduct novel research to advance the state-of-the-art in efficiency for Large Language Model / Generative AI experiences to enable their deployment at scale.
  • Work with a small group of fellow research scientists and product engineering teams to execute practical solutions for real-world impact.
  • Drive the end-to-end research agenda from establishing the problem definition to building algorithms and models.
  • Publish and contribute to top scientific conferences and journals.

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
Industry leading healthcare
Educational resources
Discounts on products and services
Savings and investments
Maternity and paternity leave
Generous time away
Giving programs
Opportunities to network and connect

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.