hero

Find Your Dream Job Today

Out for Undergrad
companies
Jobs

Reliability Engineer: Internship Opportunities

Microsoft

Microsoft

Taipei City, Taiwan
Posted on Jan 21, 2026
Overview

Come build community, explore your passions and do your best work at Microsoft with thousands of University interns from every corner of the world. This opportunity will allow you to bring your aspirations, talent, potential—and excitement for the journey ahead.

About the team and the role:

Azure Hardware Systems and Infrastructure (AHSI) is a diverse organization within Azure comprising more than 3,500 people across the United States in the Puget Sound, Silicon Valley, Hillsboro, Oregon and Raleigh, North Carolina, and internationally across Europe, Asia, and Australia. Around the world our teams invent, build, and deliver hardware infrastructure and global capacity to fuel the intelligent cloud and the intelligent edge on Azure. We are Mission First, People Always. Our Mission: Powering the world's computer with the most advanced hardware systems & infrastructure.

This Reliability Engineer Internship provides hands‑on exposure to AI hardware products used in large‑scale data center environments. The intern will support AI Fleet Hardware Quality in data analysis, issue investigation, and quality improvement activities for AI systems, while learning real‑world AI hardware quality, reliability, and fleet operations practices. The role emphasizes learning, problem‑solving, and responsible use of AI tools to accelerate analysis and coding skills.

Internship Period: 12 weeks in summer of 2026 (May.~Sep.)

Target candidates: University students from related major who will graduate between Sep 2026 and Aug 2027

At Microsoft, Interns work on real-world projects in collaboration with teams across the world, while having fun along the way. You’ll be empowered to build community, explore your passions and achieve your goals. This is your chance to bring your solutions and ideas to life while working on cutting-edge technology.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.



Responsibilities
  • Support AI Fleet Hardware Quality in analyzing quality and reliability issues for AI hardware products (e.g., GPU servers, AI racks, Liquid Cooling Solutions).
  • Assist in collecting, organizing, and analyzing quality data such as logs, telemetry, and issue records.
  • Learn and support root cause analysis (RCA) by documenting failure symptoms and observations.
  • Use AI‑assisted tools (e.g., Microsoft Copilot, GitHub Copilot, etc.) for learning, research, and skill development, and to assist with documentation, coding practice, or data analysis under guidance.
  • Help track issues, action items, and cross‑team coordination.
  • Assist in preparing concise quality summaries, dashboards, or reports.
  • Learn AI fleet quality processes, tools, and best practices used in data center environments.


Qualifications

Required Qualifications:

  • Currently pursuing a Bachelor’s or Masters’s Degree in Electrical Engineering, Mechanical Engineering, Materials Engineering, Reliability Engineering, System Engineering, or related field.
  • Must have at least one additional quarter/semester of school remaining following the completion of the internship.

Preferred Qualifications

  • Experience with coding or scripting (e.g., Python, C/C++, PowerShell).
  • Familiarity using AI tools for study and productivity, including Copilot‑based tools and public AI platforms (e.g., ChatGPT, Gemini) for learning purposes.
  • Strong logical thinking, problem‑solving, and analytical skills. Strong willingness to learn, with good communication and documentation skills.
  • Good organization and logistics skills (issue tracking, task management).
  • Interest in AI hardware, GPU systems, reliability, or data center operations.

This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.




Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.