Senior Software Engineer/SRE - TRAX Observability

Bloomberg

Bloomberg

Software Engineering

London, UK

Posted on Apr 20, 2026
About TRAX
TRade Automation and eXecution (TRAX) is part of Bloomberg Enterprise Products Engineering. We build trade automation solutions and multiple Execution Management Systems (EMSs) that enable clients to route orders, execute trades, and monitor outcomes across asset classes. Trading is the core action of financial markets. Once investment decisions are made, traders rely on our systems to execute and manage trades. Ensuring these systems are observable, scalable, resilient, and well-managed from a technical risk perspective is critical — and that’s where TRAX Observability comes in.
TRAX Observability provides the data infrastructure, dashboards, and insights needed to understand system behavior and client experience across our EMS platforms. We equip Teams, Managers, Senior Management, and Product with the tools and analysis required to reason about performance and reliability. We partner closely with TRAX engineering teams and our Scalability and Resilience counterparts in New York.
Our work focuses on:
  • Informing (or alerting) stakeholders to system performance and degradation
  • Demonstrating client impact during deployments
  • Identifying emerging client behaviors and future system needs
We build and maintain data infrastructure using firm-supported monitoring tools. This includes a custom telemetry platform that combines multiple data sources for advanced analysis, and a distributed trace pipeline (Argo, Spark, Solr) that processes large-scale data for deep investigation. We also leverage tools such as Humio, Grafana, and MetricTank to support observability across the department.
What’s in it for you?
Learning & Technical Growth
  • Work alongside experienced senior engineers with deep expertise in distributed systems, trading platforms, cloud infrastructure, and operations. You’ll gain hands-on experience building high-throughput metrics and observability systems.
Influence & Visibility
  • Observability is central to system reliability and client experience. Your work will directly impact the stability of key Bloomberg systems and help prevent client-facing issues.
Network & Stakeholder Exposure
  • Collaborate with engineering and product teams across London, Frankfurt, Tel Aviv, and New York, as well as peer SRE teams focused on Scalability and Resilience. You’ll develop strong stakeholder management and communication skills.
We’ll trust you to:
  • Enhance and maintain systems that capture and present performance metrics
  • Improve the reliability and accuracy of telemetry and analysis
  • Understand and assess client experience risks within EMS platforms
  • Communicate system health and performance to stakeholders
  • Partner across teams to strengthen observability
  • Support Scalability and Resilience initiatives with actionable data
  • Assist in triaging major incidents and production issues
You will need to have:
  • Experience with a high-level language (Python preferred, but not required; Java, C++, etc. welcome)
  • Knowledge of Unix/Linux fundamentals (or strong willingness to learn)
  • Familiarity with observability concepts (e.g., distributed tracing, logging, metrics, tools such as Grafana or similar)
  • Understanding of distributed systems concepts (replication, partitioning, scalability, messaging, state management) and eagerness to deepen that knowledge
We would love to see:
  • Experience with C, C++, or Java
  • Exposure to cloud and data-processing technologies (e.g., Argo, Spark, Solr)
  • Experience communicating across IC and leadership levels
  • Curiosity across the full software/hardware stack
  • Strong written and verbal technical communication skills