Position Title | Lead D&T System Reliability Engineer | Function/Group | Digital and Technology |
Location | Mumbai | Shift Timing | Regular |
Role Reports to | Manager –SRE | Remote/Hybrid/in-Office | Hybrid |
ABOUT GENERAL MILLS
We make food the world loves: 100 brands. In 100 countries. Across six continents. With iconic brands like Cheerios, Pillsbury, Betty Crocker, Nature Valley, and Häagen-Dazs, we’ve been serving up food the world loves for 155 years (and counting). Each of our brands has a unique story to tell.
How we make our food is as important as the food we make. Our values are baked into our legacy and continue to accelerate
us into the future as an innovative force for good. General Mills was founded in 1866 when Cadwallader Washburn boldly bought the largest flour mill west of the Mississippi. That pioneering spirit lives on today through our leadership team who upholds a vision of relentless innovation while being a force for good. For more details check out http://www.generalmills.com
General Mills India Center (GIC) is our global capability center in Mumbai that works as an extension of our global organization delivering business value, service excellence and growth, while standing for good for our planet and people.
With our team of 1800+ professionals, we deliver superior value across the areas of Supply chain (SC) , Digital & Technology (D&T) Innovation, Technology & Quality (ITQ), Consumer and Market Intelligence (CMI), Sales Strategy & Intelligence (SSI) , Global Shared Services (GSS) , Finance Shared Services (FSS) and Human Resources Shared Services (HRSS).For more details check out https://www.generalmills.co.in
We advocate for advancing equity and inclusion to create more equitable workplaces and a better tomorrow.
JOB OVERVIEW
Function Overview
The Digital and Technology team at General Mills stands as the largest and foremost unit, dedicated to exploring the latest trends and innovations in technology while leading the adoption of cutting-edge technologies across the organization. Collaborating closely with global business teams, the focus is on understanding business models and identifying opportunities to leverage technology for increased efficiency and disruption. The team's expertise spans a wide range of areas, including AI/ML, Data Science, IoT, NLP, Cloud, Infrastructure, RPA and Automation, Digital Transformation, Cyber Security, Blockchain, SAP S4 HANA and Enterprise Architecture. The MillsWorks initiative embodies an agile@scale delivery model, where business and technology teams operate cohesively in pods with a unified mission to deliver value for the company. Employees working on significant technology projects are recognized as Digital Transformation change agents.
The team places a strong emphasis on service partnerships and employee engagement with a commitment to advancing equity and supporting communities. In fostering an inclusive culture, the team values individuals passionate about learning and growing with technology, exemplified by the "Work with Heart" philosophy, emphasizing results over facetime. Those intrigued by the prospect of contributing to the digital transformation journey of a Fortune 500 company are encouraged to explore more details about the function through the provided Link
Purpose of the role
The System Reliability Engineer Lead (SRE Lead) will act as a technical lead of SRE’s to proactively to build best possible cloud solutions, ensure the stability, resilience, and scale of our services by automation, testing and engineering. The SRE technical lead would be responsible for driving the overall reliability, efficiency, and scalability of an organization's IT infrastructure, while also providing technical directions and mentorship to the SRE team.
The person would collaborate with other Infrastructure technical leads, product owners, and data/application architects to ensure that new features are reliable and supportable. Make decisions on the adoption of new technologies, tools, and practices that enhance cloud solutions and system reliability. The person should have the Ability to Influence and Drive Change within the organization, promoting SRE best practices and culture.
KEY ACCOUNTABILITIES
- Provide technical directions and guidance across the SRE team, acting as a subject matter expert and leading best practice techniques in implementing SRE practices.
- To mentor the SRE team in ensuring technical assurance in significant projects, for the delivery of quality technical deliverables, which may involve several teams or technologies.
- Provide technical coaching and mentoring to the SRE team to improve their skillset, increase knowledge and set the benchmark of quality and precision engineering.
- Implement a strategic framework of Monitoring and Observability scaling across multiple stakeholders.
- Collaborate with application architects to build new infrastructure in cloud and ensure the stability and scalability of our internal systems.
- Implement new technologies to build future of Application Hosting capabilities and how applications are built and delivered.
- Drive continuous improvement by adopting the right vendor technologies to support applications, in areas such as monitoring, operational task automation, continuous integration, deployments and performance tuning .
- Investigate and resolve complex and multi-faceted issues, spanning the entire technology stack, which require working across teams and technology boundaries.
- Proactively improve site reliability and key metrics, such as up-time, application performance, time to issue resolution, time spent resolving incidents and other key operational SLAs
MINIMUM QUALIFICATIONS
- Total 12+ years of experience in designing and implementing Applications.
- 5+ years of experience in GCP and Infrastructure as a code tool like Terraform, Ansible, Azure Resource Manager, designing cloud solutions etc.
- 3+ years’ Experience in Leading Teams or projects, preferably in a technical lead
- Deep Understanding of Monitoring and Observability Tools like Cloud Monitoring , Datadog, Grafana , or Splunk.
- Strong Knowledge of Infrastructure as Code (IaC) tools like Terraform, CloudFormation, or Ansible.
- Proven Track Record of improving system reliability, performance, and scalability in complex, distributed systems.
- Strong knowledge on Linux and/or Windows Administration and troubleshooting.
- Experience in developing CI/CD pipelines using technologies like, Jenkins, Artifactory, Vault, GitHub Actions.
- Containerization, container orchestration, deployment/monitoring.
- Familiarity with container ecosystem/technologies, such as, Kubernetes, and the deployment/monitoring of those systems, especially Jenkins, Artifactory, Vault etc.
- Expertise in deployment, building, scanning, and monitoring of the applications in Cloud (GCP)
- Experience interacting with Infrastructure APIs like DNS, F5, etc
- Experience working in Agile teams, defining SLO’s and SLI’s for products.
- Leadership Abilities with experience mentoring and guiding team members.
- Strong Problem-Solving Skills and the ability to make decisions under pressure.
- Excellent Communication Skills to effectively collaborate with cross-functional teams.
PREFERRED QUALIFICATIONS
- Master’s Degree in Computer Science, Engineering, Information Technology, or a related field.
- GCP Professional Cloud Architect certification or similar certifications in cloud and DevOps practices.
- Has significant experience in DevOps implementation and in evolving practices and ways of working through multi-disciplinary teams, business frameworks and culture.
- Experience deploying and running applications on Linux in containers.
- Knowledge of Debugging tools like WinDBG, Debug Diag etc
- Azure and/or AWS Cloud experience.
- Knowledge of Server configuration and hardening
- Experience supporting MS IIS in an enterprise environment
- Understanding of networking protocols and topography
- Strong understanding of firewalls, DNS, TCP/IP, HTTP
- Experience with traffic load balancers such as F5’s, GCP Load balancers etc
- Development experience on the Microsoft stack
- Experience with Kafka and Elastic.