Senior Software Engineer - Cloud Capacity Management Automation System
Microsoft
Azure is Microsoft’s central cloud infrastructure hosting both public cloud offerings and a large number of Microsoft internal cloud‑scale services. Cloud computing is a highly competitive and rapidly growing market, and Azure aims to be an industry leader across all relevant aspects of its platform and services. Within Azure, the Azure Compute team is the core infrastructure organization responsible for hosting virtual machines (VMs), containers, and other workloads that support customers and internal services at global scale.
A fundamental discipline in cloud computing is capacity management, which must ensure sufficient capacity across all regions, allocation domains, and hardware infrastructure to meet customer demand while also provisioning efficiently to avoid overspending and negative cost impacts related to cost of goods sold (COGS) and capital expenditure (CAPEX). At the scale of Azure, managing this balance across the entire Azure Compute fleet is a highly complex challenge, where improvements can prevent customer allocation failures while also enabling significant efficiency and cost savings.
The Azure Compute Capacity and Efficiency (AC2E) team is responsible for managing all aspects of capacity and efficiency across the Azure Compute fleet. The team’s primary responsibility is to deliver a fully automated and highly optimized tracking and management system, including the Capacity Management Automation System (CMAS), which uses state‑of‑the‑art algorithms and artificial intelligence (AI) to predict capacity risk and execute mitigation actions directly within the Azure Compute platform. Team members work closely with engineers, program managers, data scientists, and capacity planning partners to define business problems and drive solutions end to end from design through production, while also contributing to strategic decision making for features that impact capacity and efficiency. The impact of this work is reflected in improvements to the Azure platform, service capacity fulfillment rates, customer satisfaction, and efficiency metrics such as COGS reduction, supported by applied data science techniques including anomaly detection, machine learning, and experimentation methodologies.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Responsibilities
- Promote cross group collaboration, working across boundaries and building partnerships with stakeholder teams.
- Delivery of technical solutions with an appropriate balance of risk taking and prioritization.
- Lead by example, engage in discussions on the architecture of products or solutions, and apply engineering principles to establish robust and maintainable architecture and designs.
- Driving engineering improvements and rigor within the team as well as with stakeholders.
- Understand platform capacity constraints and work with teams across Azure to improve capacity manageability and efficiency.
- Drive improvements to the product design and architecture, leading to increased customer satisfaction
- Lead and collaborate with experts from across the company to advance capacity management, capacity planning, and efficiency.
Qualifications
Required Qualifications
- Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
Other Requirements
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
Preferred Qualifications
- Master's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR Bachelor's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
#azurecorejobs
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.