AI Platform & Infra Architect
Cummins Turbo Technologies
工作总结:
- 负责评估、定义并倡导根据康明斯 IT 战略保持一致的 IT 体系结构策略、原则、政策和标准,并规范其用途;协调项目团队,创建、记录、评估和/或验证特定系统的应用程序和基础设施解决方案设计,以满足康明斯的安全政策,规范,合规性和法规,并与利益相关者一起审查体系结构工件。
主要责任:
有效地运用风格、堆栈和模式等企业架构标准。
开发项目解决方案设计,满足业务和技术要求。
定义符合 IT 技术标准的参考架构,作为其他解决方案的构建块反复使用。
作为变革推动者参与其中,促进战略性 IT 架构方向 - Cloud first、API first、DevOps 和 Agile 等。
领导和促进技术分析、解决方案设计和技术领域会议,推动最佳 IT 解决方案的产生。
与客户和 IT 同事建立良好的关系,使用标准技术有效地促进解决方案的开发。
维持对新兴技术、软件趋势和工具的意识。
考虑多个不同的方案,根据成本/利润分析提出恰当的解决方案。
促进和支持试点和/或概念验证活动,以验证技术能力。
与项目团队合作,了解所需的能力并建议能够满足需求的合适技术组合。
针对新的应用程序和系统,提供基础设施能力建议。
Qualifications
AI Platform / Infra Architect (Senior Level)
Cummins AI Lab – Artificial Intelligence Laboratory
About Cummins AI Lab
The AI Lab develops robust, scalable infrastructure powering LLMs, agent frameworks, retrieval systems, knowledge platforms, and enterprise AI applications across Cummins’ global footprint.
We design the backbone of Cummins’ AI ecosystem.
We are seeking a Senior AI Platform / Infra Architect to lead compute, cloud, deployment, and infrastructure architecture for enterprise-scale LLM and AI systems.
Role Overview
This senior technical role requires expertise across cloud architecture, distributed systems, LLM deployment, GPU infrastructure, networking, storage, and end-to-end engineering .
You will architect the platforms that power Cummins’ AI solutions globally.
Key Responsibilities
1. AI Infrastructure & Deployment Architecture
Design scalable LLM inference and service architectures (vLLM, Triton, DeepSpeed-Inference).
Architect hybrid cloud/on-prem compute for model serving, retrieval, and agent workflows.
Ensure high availability, security, performance, observability, and compliance.
2. End-to-End Platform Engineering
Build CI/CD, LLMOps, automation, and deployment pipelines.
Optimize compute, networking (RoCE/IB), caching, and I/O across AI workloads.
Implement monitoring, logging, tracing, and cost optimization.
3. Foundational Platform Capability Development
Build platform components including data ingestion layers, vector databases, model serving gateways, and knowledge systems.
Enable product teams and AI teams to develop scalable solutions on top of the infra.
4. Technical Leadership & Collaboration
Partner with AI scientists, Solution Architects, and Product teams to ensure platform alignment.
Lead technical design reviews and drive architectural standards.
Must-Have Qualifications
Master’s degree in Computer Science, Distributed Systems, Cloud Computing, or related fields; OR equivalent industry experience.
Strong hands-on experience with cloud-native systems: Kubernetes, Docker, microservices, service mesh.
Deep expertise in GPU/TPU accelerators, distributed inference/training, high-performance networking.
Proven experience deploying AI systems into production environments.
Fluency across IaaS/PaaS/SaaS layers and hybrid cloud infrastructure.
Strong backend and systems engineering ability (APIs, infra automation, DevOps/LLMOps).
Experience in enterprise security and compliance.
Preferred Qualifications
Experience in major cloud providers or AI infrastructure teams.
Familiarity with industrial IT/OT integration.
Experience architecting large-scale, high-performance inference clusters.
What You Will Gain
Ownership of core AI infrastructure for a global enterprise.
Opportunity to architect high-impact, scalable AI systems that support key engineering and industrial workflows.
Responsibilities
技能:
协作 - 建立合作伙伴关系并与他人协作,以达成共同目标。
有效沟通 - 发展和实现多模式沟通,清晰了解不同受众的特定需求。
决策质量 - 及时作出高质量的决策,推动组织发展。
建立信任 - 做到诚实、正直和真实,赢得他人的信任和信赖。
优化工作流程 - 了解最有效和高效的流程,并不断改善,以完成工作。
建筑系统设计 - 通过使用架构开发框架和方法来应用架构原理,定义和开发架构可交付成果、工件和构件(例如概念模型、逻辑模型、物理网络设计等),使组织能够以受控方式转换其系统以满足业务需求。
解决功能拟合分析 - 使用程序、工具和工作助手来组合系统并将系统分解为组成部分,研究组合部件的设计、购买和配置能够在多大程度上实现完整交互,以满足业务、技术、安全、治理和合规要求。
系统解决方案架构 - 使用康明斯技术参考模型 (CTRM),CLEAN 标准和现有参考模式,创建解决方案设计和模式,确保与康明斯标准保持一致。
看重差异性 - 认识到不同视角和文化给组织带来的价值。
业务定义 - 使用业务分析工具包(对五个方面进行建模并创建用例)来定义方案将提供的业务成果,以证明资源投资(人员、时间、财务)的合理性。
教育,资格,认证:
- 要求具有计算机科学、信息技术、商科或相关专业的大专、本科或同等学历,或相关的同等工作经验。该职位可能需要获得有关遵守出口管制或制裁法规的许可证。
经历:
- 要求具备中级水平的相关工作经验。拥有 3-6 年的经验。
100% On-Site No
As Cummins continues to grow, you'll be provided with continuous learning opportunities, supportive benefits and a culture that values your wellbeing, safety and work-life balance. Here, you'll have the power to determine your future with innovative technology, a focus on sustainability and with a company positioned for long-term growth.