Director of DevOps / SRE - Equities Trading
JPMorganChase
Software Engineering
Singapore
If you are looking for a game-changing career, working for one of the world's leading financial institutions, you’ve come to the right place.
As an Executive Director in DevOps / Site Reliability Engineering (SRE) at JPMorganChase, within the Corporate & Investment Bank Equities Trading organization, you will set and drive the reliability, automation, and operational resilience strategy for a high-performance, latency-sensitive trading platform operating across multiple global regions. You will lead teams and cross-functional stakeholders to improve production excellence—reducing failure costs, improving availability, and increasing engineering capacity—through standardization, automation, and measurable operational outcomes aligned to a consumer-first mindset.
You will own the governance and delivery framework for significant DevOps/SRE initiatives, ensuring clear outcomes, delivery plans, dependencies, risk management, and lifecycle reporting from initiation through closure. You will also operationalize a shared responsibility model across platform, infrastructure, and application engineering teams to ensure consistent security, compliance, and production standards.
Job Responsibilities
- Define and drive the DevOps/SRE strategy and roadmap for the Electronic Market Making platform, aligned to measurable reliability and productivity outcomes
- Own governance for significant DevOps/SRE initiatives (documented outcomes, delivery plans, milestones, dependencies, and lifecycle reporting through closure)
- Lead cross-functional delivery across application engineering, infrastructure, network, and platform teams, ensuring clear accountability across regions
- Establish production excellence standards and oversee incident management, post-incident reviews, and systemic remediation to reduce failure costs
- Set and continuously improve resilience practices including automated failover, disaster recovery, and production readiness controls
- Own SLO/SLA and observability strategy (monitoring, alerting, and service health metrics/KRIs) to ensure uptime and performance transparency
- Lead CI/CD, release engineering, and multi-region deployment automation (build/test orchestration, artifact management, controlled promotion) to increase delivery confidence and speed
- Drive developer enablement via internal platforms, self-service tooling, and automation that increases engineering capacity while maintaining controls
- Implement and enforce a shared-responsibility operating model across platform/product and LoB engineering teams to meet security, compliance, and operational standards
- Lead and develop DevOps/SRE talent (hiring, coaching, performance) and manage operational and delivery risks with clear escalation and mitigation
Required Qualifications, Capabilities, and Skills
- Bachelor’s degree in Computer Science, Engineering, or equivalent practical experience.
- Extensive experience in DevOps, SRE, Production Engineering, or Platform Engineering, including leadership of teams and/or large cross-functional initiatives with measurable operational outcomes
- Experience supporting electronic trading platforms and/or other low-latency environments (market data, exchange connectivity, order routing)
- Demonstrated accountability for production excellence: incident management oversight, root-cause discipline, reliability improvements, and operational resilience practices
- Strong experience with CI/CD and release engineering at scale (e.g., Jenkins, GitLab CI, or equivalent), including multi-region deployment patterns and governance
- Strong Linux experience (RHEL-based) and system performance diagnostics; ability to set standards for tuning, capacity management, and monitoring.
- Strong automation background (Python and shell), with an emphasis on building reliable, supportable tooling and self-service platforms
- Experience with infrastructure automation and configuration management (e.g., Ansible/Puppet/Salt) and infrastructure-as-code practices.
- Strong networking fundamentals (TCP/IP, DNS, load balancing) and the ability to partner effectively with network teams supporting market data and exchange connectivity.
- Strong communication and stakeholder management skills, including executive-level updates driven by data, risk, and delivery transparency
Preferred Qualifications, Capabilities, and Skills
- Expertise with observability stacks and operational analytics (e.g., Prometheus/Grafana, Splunk, ELK), emphasizing actionable alerting and service health management
- Experience with containerization and orchestration technologies (Docker, Kubernetes) in regulated, production-grade environments
- Familiarity with message-oriented middleware or pub/sub systems (e.g., AMPS, Kafka)
- Understanding of Equities, Options, and Futures market structure
- Experience working in regulated financial environments with strong production controls and audit-ready practices
J.P. Morgan is a global leader in financial services, providing strategic advice and products to the world’s most prominent corporations, governments, wealthy individuals and institutional investors. Our first-class business in a first-class way approach to serving clients drives everything we do. We strive to build trusted, long-term partnerships to help our clients achieve their business objectives.
J.P. Morgan’s Commercial & Investment Bank is a global leader across banking, markets, securities services and payments. Corporations, governments and institutions throughout the world entrust us with their business in more than 100 countries. The Commercial & Investment Bank provides strategic advice, raises capital, manages risk and extends liquidity in markets around the world.
Provide expertise and engineering excellence to enhance, build and deliver market-leading technologies within the firm