HPC System Administrator m/f/d
IT
Stuttgart, Germany
Why Work at Lenovo
Description and Requirements
Join Lenovo, a global technology leader driving innovation across AI, data center solutions, smart devices, and digital transformation. As the world's #1 provider of supercomputing systems, we help leading organizations solve complex challenges through cutting-edge HPC (High Performance Computing) and AI technologies.
We're looking for an HPC System Administrator to join our Data Center Operations team in Munich, Germany. In this customer-facing role, you'll support and optimize advanced HPC and AI environments, working with Linux systems, servers, storage, networking, and data center infrastructure while ensuring secure, reliable, and high-performing platforms for leading enterprises and research organizations.
What You Will Do:
Act as the primary technical contact (SPOC) for customers, providing expert support across HPC (High Performance Computing) and AI platforms while building strong long-term customer relationships.
Monitor and maintain critical data center infrastructure, including servers, storage, networking, operating systems, cluster management software, power, cooling, and environmental systems to ensure reliability, availability, and security.
Troubleshoot and resolve hardware, software, operating system, firmware, network, and infrastructure issues, coordinating with vendors and internal teams to quickly minimize downtime and service disruptions.
Manage daily system administration activities, including user access management, software deployment, OS upgrades, firmware updates, security patching, defect resolution, and ongoing platform maintenance.
Take ownership of incident and problem management by responding to system alerts, monitoring performance and system health, identifying root causes, opening vendor support tickets, and tracking issues through resolution.
Support the installation, configuration, testing, upgrade, and optimization of HPC and AI environments, helping customers continuously improve their research computing capabilities and platform performance.
Work closely with customer technology, infrastructure, security, governance, and vendor teams to deliver projects, implement controlled changes, maintain documentation, ensure compliance, and support audits and service reviews.
Provide technical guidance, knowledge transfer, and training to customers and researchers, contribute to documentation and best practices, participate in projects and customer meetings, and help maximize the value of HPC resources through outstanding customer support and analytical problem-solving skills.
Strong Linux administration knowledge, including SUSE, RHEL, and CentOS environments.
Hands-on expertise with Confluent and Slurm in enterprise or HPC environments.
Background in system administration, monitoring, troubleshooting, and technical support of complex infrastructure.
Ability to independently perform operating system installations, software deployments, firmware updates, and system upgrades.
Strong problem-solving skills with the ability to diagnose, investigate, and resolve complex technical issues.
Experience supporting mission-critical systems, ensuring reliability, performance, and operational stability.
Fluent English communication skills, both written and spoken, with the ability to work directly with international customers.
Customer-focused mindset with strong communication and stakeholder management skills, capable of providing high-quality technical support and guidance.
What Lenovo Can Offer You:
- Employee Share Purchase Plan
- Employee Assistance Program, e.g., for health, legal & financial consultancy
- Pension Plan
- Meal Allowance / Lunch Vouchers
- Internal E-learning Development Platform Available for Employees
- Specialized Development Trainings (based on nomination process)
- Employees Groups (LGBT+, WILL, etc.)
- Opportunity to Join/Create Employees Groups (inclusivity, well-being, sports, volunteering, charity, etc.)
- Job Rad (Bike Leasing)
- Mobile phone + 3 Sim Cards for Mobile Working
- Link DE: Lenovo Central (sharepoint.com)
- Hybrid Working Model *