Principal Software Engineer
Microsoft
#MSC #IDC #MSCIDC
Are you looking for an opportunity to work with the latest Azure offerings and push the limits of cloud computing? Are you inspired to push the boundaries of how intelligent systems transform cloud experiences? Do you want to solve real-life challenges in intelligent cloud and enable customers to achieve more? If so, we have the perfect role for you!
We are part of the Microsoft Specialized Cloud organization, delivering Azure to customers on their premises. Our team is innovating new technologies for managing Edge Devices from Azure in a safe, secure, sovereign, reliable, and scalable model for our global customers across various Azure Geographies. We pride ourselves in developing technologies that enhance customer experience and enable using edge computing to address customer workload needs and managing edge devices in Azure. We build ecosystems that bring intelligence to locations where customers are running their business. We enable real-time decision-making, ultra-low-latency experiences, and trustworthy AI systems that operate securely at the edge and in the cloud. If you’re passionate about AI-driven engineering — from intent understanding and data-driven insights to autonomous orchestration and predictive edge intelligence — this is where your work will have global impact.
As part of this role in the Microsoft Specialized Cloud team, you operate as a senior technical leader shaping the data & analytics foundations that power this org’s products and platforms at a global scale. The engineer will collaborate with Product, Engineering, and Data Science stakeholders from multiple teams to analyse data requirements, understand the data flow across existing and new products, differentiate human vs. AI Agent generated data, lead the design of data models, run the product with an AI first mindset. You will be responsible for identifying data sources, building code to extract raw data, and transforming it into formats compatible with downstream consumers. You will develop tools, do performance monitoring, and perform root cause analysis of issues to ensure robust and reliable data pipelines. You will define architectural direction, influence multi-team roadmaps, and set data engineering standards that enable reliable, secure, and scalable analytics for AI-driven products. To accelerate our Copilot and agent-driven initiatives across Edge you will have to own the technical vision, system design, and implementation strategy for next-generation AI copilots.
The team is supportive of flexible work arrangements and encourages a healthy work-life balance.
If you're ready to take on the challenge of working with highly motivated engineers, with the latest of the Azure offering and taking them to new heights, then we'd love to hear from you! Please apply today and let's build the future together! The team is supportive of flexible work, and you may work from home up to 50% of the time.
Responsibilities
Responsibilities
• Design, build, and operate scalable AI systems that power intelligent product experiences, including Copilot and agent-driven workflows.
• Architect and implement backend services that support multi-step AI interactions, including orchestration pipelines, context management, memory/state persistence, and tool execution.
• Integrate large language models (LLMs), APIs, and internal services to enable context-aware, human-in-the-loop experiences across customer scenarios.
• Build and maintain data and inference pipelines that support model training, fine-tuning, evaluation, and real-time inference across diverse data sources.
• Evaluate, benchmark, and tune AI/ML models (LLMs and traditional models) to meet product requirements for accuracy, latency, reliability, and safety.
• Implement robust retrieval, grounding, and knowledge integration mechanisms (e.g., RAG systems, semantic indexing, vector search) to power intelligent applications.
• Collaborate with product managers, software engineers, and researchers to translate product vision into production-ready AI capabilities and measurable outcomes.
• Ensure reliability, observability, and governance of AI systems, including monitoring model performance, data quality, and responsible AI practices.
• Build reusable platforms, APIs, and tools that enable teams to rapidly develop AI-powered features and self-service intelligent applications.
Qualifications
-
10–12+ years of experience in software engineering, with significant experience building scalable backend or distributed systems. Strong programming expertise in one or more languages such as Python, Go, Java, or C#, with experience designing production-grade services and APIs.
• Experience building AI-powered applications, including integrating LLMs, implementing agent or Copilot workflows, and orchestrating multi-step AI interactions.
• Hands-on experience with LLM application frameworks and orchestration tools such as Semantic Kernel, LangChain, or similar agent frameworks.
• Familiarity with retrieval-augmented generation (RAG) architectures, vector databases, embeddings, and semantic search systems.
• Experience evaluating and improving model performance through prompt design, evaluation frameworks, fine-tuning, or feedback loops.
• Solid understanding of distributed systems concepts including scalability, reliability, observability, caching, and asynchronous processing.
• Experience deploying and operating AI workloads in cloud environments (preferably Azure), including containerized services and GPU-enabled infrastructure.
• Understanding of Responsible AI practices, including model governance, safety, privacy, and evaluation of AI behaviour in production systems.
• Ability to work across product, research, and engineering teams to translate product scenarios into scalable AI system architectures.
• Strong communication skills and ability to explain complex technical concepts to both technical and non-technical stakeholders.
Links:
Azure Stack HCI is a new hyperconverged infrastructure (HCI) operating system delivered as an Azure service that provides the latest security, performance, and feature updates. Deploy and run Windows and Linux virtual machines (VMs) in your data center or at the edge using your existing tools, processes, and skill sets.
This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.