hero

Find Your Dream Job Today

Out for Undergrad
companies
Jobs

Software Engineer III-SRE

JPMorganChase

JPMorganChase

Software Engineering
Bengaluru, Karnataka, India
Posted on Feb 23, 2026

There’s nothing more exciting than being at the center of a rapidly growing field in technology and applying your skillsets to drive innovation and modernize the world's most complex and mission-critical systems.

As a Site Reliability Engineer III at JPMorgan Chase within the Employee Platform, you will solve complex and broad business problems with simple and straightforward solutions. Through code and cloud infrastructure, you will configure, maintain, monitor, and optimize applications and their associated infrastructure to independently decompose and iteratively improve on existing solutions. You are a significant contributor to your team by sharing your knowledge of end-to-end operations, availability, reliability, and scalability of your application or platform.

Job responsibilities

  • Collaborate with software engineers and partner teams to design, develop, test, and implement highly available, reliable, and scalable application solutions
  • Implement infrastructure, configuration, and network as code for applications and platforms
  • Design and implement deployment approaches using automated CI/CD pipelines
  • Resolve complex technical problems by engaging technical experts, stakeholders, and team members
  • Define, measure, and review service level indicators and use service level objectives to proactively prevent customer impact
  • Support and advocate for adoption of site reliability engineering best practices within the team
  • Automate operational processes, runbooks, and recovery procedures to reduce toil and improve reliability
  • Establish observability across services, including metrics, logs, and traces, to enable rapid detection and diagnosis
  • Perform incident response, post‑incident reviews, and root‑cause analysis to drive corrective actions
  • Conduct capacity planning, performance tuning, and resilience testing to meet growth and reliability goals
  • Document architectures, operational procedures, and deployment configurations to ensure repeatability and compliance

Required qualifications, capabilities, and skills

  • Formal training or certification on software engineering concepts and 3+ years applied experience

  • Hands on experience, as a software engineer and/or site reliability engineer
  • Program proficiently in Python and/or Java for large‑scale data handling and migration
  • Operate platforms and applications on public, private, or hybrid cloud infrastructures
  • Hold formal training or certification in site reliability engineering (SRE) and apply at least 3 years of SRE experience
  • Implement observability using white‑ and black‑box monitoring, SLO‑based alerting, and telemetry with Grafana, Dynatrace, Prometheus, Datadog, and Splunk
  • Apply SRE culture and principles in real‑world applications or platforms
  • Apply knowledge of software applications and technical processes across Cloud, Artificial Intelligence, and Machine Learning
  • Build and maintain CI/CD pipelines with tools such as Jenkins, GitLab, and Terraform
  • Troubleshoot networking technologies and issues; collaborate effectively in large teams, communicate clearly, remove roadblocks proactively, and innovate while staying current with emerging technologies

    Preferred qualifications, capabilities, and skills

  • Solve complex, mission‑critical problems across one or more technology domains
  • Develop automated tools, systems, and services spanning multiple technology domains
  • Apply working knowledge of infrastructure components such as routers, load balancers, cloud products, containers, compute, storage, and networks
  • Debug and troubleshoot systems; implement service‑level changes and manage operations with monitoring and log analysis tools

Build a Resilient System for a Communication and collaboration space which includes complex & mission critical systems.