Jr. AI Reliability Operations Engineer
Lenovo
Software Engineering, Operations, Data Science
Chicago, IL, USA
USD 80k-90k / year
Why Work at Lenovo
Description and Requirements
About Qira
Qira is Lenovo’s cross‑device Personal AI that works across phones, PCs, and other Lenovo and Motorola products. It combines on‑device intelligence and cloud intelligence to provide a consistent and helpful AI experience. Qira understands context across devices, supports voice and text interactions, and helps users complete tasks in a smooth and reliable way. The Qira engineering team builds and operates the systems that power this experience, including the cloud services, device integrations, data flows, and AI components behind it.
About Our Team
We are looking for an AI Reliability Operations Engineer to support the operational health of Qira's production and non-production systems. This role spans system monitoring, alert response, incident triage, and observability across the full AI stack, including model performance, inference pipelines, and cloud services. You will also have visibility into SDLC operations across staging and pre-production environments, helping ensure that releases and configuration changes land cleanly. This is a foundational role in keeping Qira stable and available for users around the world.
Location: Onsite in Chicago, IL; Hybrid (3 days onsite, 2 days remote)
What You'll Do
- Monitor Qira's production and non-production systems using observability dashboards, alerting tools, and AI-specific signals including model performance, inference latency, and data pipeline health.
- Perform initial triage on active incidents and alerts, following runbooks to assess impact, gather relevant data, and escalate accurately to the appropriate engineering teams.
- Observe and report on SDLC operations across staging and pre-production environments, flagging anomalies and supporting engineering teams during releases and configuration changes.
- Watch proactively for early warning signals across Qira's cloud services, device integrations, and AI components, not just respond to active alerts.
- Verify system health before and after deployments and configuration changes, and assist engineering with deployment checks.
- Track and maintain incident progress in ticketing systems and ensure clear, accurate records throughout every issue.
- Contribute to post-incident reports.
- Share timely status updates with the operations team and shift lead during active incidents.
- Review alert thresholds, update runbooks, and flag procedural gaps to the shift lead or SRE.
Basic Qualifications
- Bachelor's Degree required.
- Experience with cloud concepts, distributed applications, and system monitoring.
- Experience with Linux commands and simple networking concepts.
- Experience with observability tools such as Grafana, Datadog, or cloud-native dashboards.
Preferred Qualifications
- Experience in a technical operations, SRE, or production support environment.
- Familiarity with alerting tools such as PagerDuty or OpsGenie and ticketing systems such as Jira or ServiceNow.
- Clear written and verbal communication in English, including the ability to write accurate incident updates under pressure.
- Comfortable following written procedures and runbooks precisely in a fast-paced operational environment.
- Exposure to AI or ML systems, including awareness of how model quality and data pipelines are monitored.
- Basic scripting ability or comfort reading and adapting existing scripts and runbook commands.
- Experience working across time zones in a globally distributed team.
- Ability to work assigned shifts including nights, weekends, and holidays.
What Success Looks Like
A successful AI Reliability Operations Engineer detects issues early, responds to alerts quickly, performs accurate initial triage, and keeps clear and complete records across production and non-production environments. Their work directly supports system uptime and ensures that Qira delivers a reliable, consistent experience for users at all times.
The base salary budgeted range for this position is $80K - $90K. Individuals may also be considered for bonus and/or commission.
Lenovo’s various benefits can be found on www.lenovobenefits.com.