Principal Software Engineer - Observability
Microsoft
Principal Software Engineer - Observability
Dublin, Ireland
Save
Overview
Are you passionate about client and service telemetry and observability for global services? Office 365 is the locomotive that is driving the growing Microsoft valuation, and critical to the future of Microsoft. OneDrive and SharePoint (ODSP) are the set of intelligent, high value services and compliant environment that is enabling the next generation of transformative end-user experiences for Office and the entire company. The ODSP team has an opportunity for you get in on designing and building a core part of the observability ecosystem that is critical to how we deliver world-class reliability for our customers, safely and in compliance with global regulations and policies.
This Principal Software Engineer position is for an Individual Contributor (IC) on the ODSP Observability Engineering team to develop new service features for the management and usage of telemetry generated by our services and clients.
The service is globally distributed, highly available and resilient, and has very high demands for a robust observability ecosystem that is compliant with geo-regional policies and regulations.
Your responsibility will be to analyze, design, and implement improvements to how the service generates and processes telemetry, how those telemetry are stored, managed and accessed and how such access policies adhere to various geo-regional policies and regulations while still being easy to use by our engineering teams. Key goals are to improve performance and security, reduce Cost of Goods Sold (COGS), and drive those changes across multiple products in both ODSP and Microsoft.
The ideal candidate should have strong analytical, design, and development skills with depth in telemetry ecosystems, databases, storage, high performance data structures, and algorithms and a passion for observability.
She/He should have a strong development background, excellent communication skills, and a strong foundation in Computer Science. Lastly, because the work is done indirectly on behalf of a large team, influencing without authority is key to success.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Qualifications
- Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- Collaborate with partner teams to meet the engineering goals in a unified manner.
- Proficiency in C# or C/C++, and strong design, implementation, and debugging skills; knowledge of scripting languages a plus.
- Deep experience with telemetry and observability standards such as OpenTelemetry, Prometheus, Grafana, Parquet, Loki, or the Microsoft Observability platform.
- Experience with distributed systems, performance analysis, databases, and/or large-scale data processing.
- Strong communication skills (both written and oral).
- Ability to prioritize tasks and work independently.
Other Requirements:
Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings:
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.
Preferred Qualifications:
- Knowledge of Azure services and observability systems a plus.
- Experience with data science, analytics and ontology a plus.
- Experience with building cloud-scale infrastructure components.
- Awareness, passion, and experience related to cloud scale distributed design and patterns.
- Familiar with secure software design concepts.
- Proven track record of delivering projects that include multiple components.
#ODSPEng
Responsibilities
- Partners with appropriate stakeholders to determine user requirements for a set of scenarios.
- Leads identification of dependencies and the development of design documents for a product, application, service, or platform.
- Leads by example and mentors others to produce extensible and maintainable code used across products.
- Leverages subject-matter expertise of cross-product features with appropriate stakeholders (e.g., project managers) to drive multiple group's project plans, release plans, and work items.
- Holds accountability as a Designated Responsible Individual (DRI), mentoring engineers across products/solutions, working on-call to monitor system/product/service for degradation, downtime, or interruptions.
- Proactively seeks new knowledge and adapts to new trends, technical solutions, and patterns that will improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale and shares knowledge with other engineers.