Find Your Dream Job Today

Our mission is to help high-achieving LGBTQ+ undergraduates reach their full potential.

Principal Engineer - Sr. Site Reliability Engineer

Wells Fargo

Wells Fargo

Software Engineering
Multiple locations
Posted on Tuesday, May 21, 2024

About this role:

We are looking for a Sr. Site Reliability Engineer who enjoys and thrives on solving complex problems through innovation impacting change at scale in a diverse environment. You will join a focused team of Application Support and SREs introducing and advancing SRE discipline across several hundred applications and multiple vertical lines of business supporting the entire firm. The team will drive technology transformation and adoption of SRE aligned enterprise capabilities and products, launch new tooling enablement, automate away complex issues and integrate with the latest technology. Site Reliability Engineers leverage their experience as software and systems engineers to ensure applications onboarded to SRE are available, have full stack observability, introduce continuous improvement through code and automation, provide operational insight through analytics, continuously test, are integrated with CI/D and work with application teams to ensure products and service we provide are always on.

In this role, you will:

  • Instantiate Site Reliability Engineering and AIOPs capabilities at Wells Fargo Enterprise Functions Technology (EFT) igniting the practice, principles, and culture leading by example. Assist in training skilled peer engineers by growing the practice within EFT and partnering with peer platform embedded SRE teams.
  • Introduce and mature the adoption of enterprise capabilities, tools, and innovation improving availability in a multi-cloud ecosystem by evolving observability, monitoring, logging, synthetic monitoring and chaos engineering.
  • Evolve AIOPS, introducing self-healing and autonomic capabilities solving for complex operational and systemic issues with precision including, automating processes, leveraging Robotic Process Automation and AI/ML to improve availability of products we provide to customers
  • Automate key SRE metrics and IT Service Operations processes including customer impact, availability of critical business flows, SLO/SLI adherence, error budget, and reduce time to recovery.
  • Share support responsibilities for critical applications and customer journeys including leading technical resolution of high priority incidents with cross-functional partners, remediation of issues, conducting of blameless post mortems, root cause analysis and introduce continuous improvement solving problems once and for all with the goal of no repeats.
  • Closely collaborate with EFT application development teams and other peer organizations to influence and drive stability and SRE aligned capability.
  • Act as an advisor to leadership to develop or influence applications, network, information security, database, operating systems, or web technologies for highly complex business and technical needs across multiple groups
  • Lead the strategy and resolution of highly complex and unique challenges requiring in-depth evaluation across multiple areas or the enterprise, delivering solutions that are long-term, large-scale and require vision, creativity, innovation, advanced analytical and inductive thinking

Required Qualifications:

  • 10+ years of Engineering experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education
  • 7+ years of Java, C#, Python or other object oriented software engineering experience
  • 5+ years of experience performing engineering and support tasks on Linux/Unix and Windows Servers
  • 3+ years of experience with Cloud technologies
  • 3 + years of experience supporting enterprise level complex applications and platforms in Production
  • 5 + years of designing and building complex observability solutions leveraging industry standard toolset and or custom built solutions
  • 5+ years working with configuration and monitoring technologies such as Ansible, Grafana, Elastic, Splunk, Prometheus.
  • Strong verbal, written, and interpersonal communication skills

Desired Qualifications:

  • A Masters degree or higher in computer science or engineering
  • Experience with design, implementation and governance with Artificial Intelligence, Natural Language Processing or Machine Learning Architecture
  • Experience with Agile Scrum (Daily Standup, Sprint Planning and Sprint Retrospective meetings) and Kanban

Job Expectations:

  • Ability to travel up to 10%
  • In office expectations of three days at one of listed location

Pay Range

$144,400.00 - $300,000.00


Wells Fargo provides all eligible full- and part-time employees with a comprehensive set of benefits designed to protect their physical and financial health and to help them make the most of their financial future. Visit Benefits - Wells Fargo Careers for an overview of the following benefit plans and programs offered to employees.

  • 401(k) Plan
  • Paid Time Off
  • Parental Leave
  • Critical Caregiving Leave
  • Discounts and Savings
  • Health Benefits
  • Commuter Benefits
  • Tuition Reimbursement
  • Scholarships for dependent children
  • Adoption Reimbursement

Posting End Date:

15 Jun 2024

*Job posting may come down early due to volume of applicants.

We Value Diversity

At Wells Fargo, we believe in diversity, equity and inclusion in the workplace; accordingly, we welcome applications for employment from all qualified candidates, regardless of race, color, gender, national origin, religion, age, sexual orientation, gender identity, gender expression, genetic information, individuals with disabilities, pregnancy, marital status, status as a protected veteran or any other status protected by applicable law.

Employees support our focus on building strong customer relationships balanced with a strong risk mitigating and compliance-driven culture which firmly establishes those disciplines as critical to the success of our customers and company. They are accountable for execution of all applicable risk programs (Credit, Market, Financial Crimes, Operational, Regulatory Compliance), which includes effectively following and adhering to applicable Wells Fargo policies and procedures, appropriately fulfilling risk and compliance obligations, timely and effective escalation and remediation of issues, and making sound risk decisions. There is emphasis on proactive monitoring, governance, risk identification and escalation, as well as making sound risk decisions commensurate with the business unit’s risk appetite and all risk and compliance program requirements.

Candidates applying to job openings posted in US: All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other legally protected characteristic.

Applicants with Disabilities

To request a medical accommodation during the application or interview process, visit Disability Inclusion at Wells Fargo.

Drug and Alcohol Policy

Wells Fargo maintains a drug free workplace. Please see our Drug and Alcohol Policy to learn more.