Data Pipeline Engineer
Lenovo
This job is no longer accepting applications
See open jobs at Lenovo.See open jobs similar to "Data Pipeline Engineer" Out for Undergrad.Why Work at Lenovo
We are Lenovo. We do what we say. We own what we do. We WOW our customers.
Lenovo is a US$62 billion revenue global technology powerhouse, ranked #217 in the Fortune Global 500, employing 77,000 people around the world, and serving millions of customers every day in 180 markets. Focused on a bold vision to deliver smarter technology for all, Lenovo has built on its success as the world’s largest PC company by further expanding into growth areas that fuel the advancement of ‘New IT’ technologies (client, edge, cloud, network, and intelligence) including server, storage, mobile, software, solutions, and services.
This transformation together with Lenovo’s world-changing innovation is building a more inclusive, trustworthy, and smarter future for everyone, everywhere. To find out more visit www.lenovo.com, and read about the latest news via our StoryHub.
Description and Requirements
At Lenovo, we Never Stand Still. Every day, every employee at Lenovo is focused on moving forward, rejecting traditional limits, and always seeking a better way.
Do you have a passion for data and building pipelines that power cutting edge AI? Are you a results-oriented engineer who thrives in a collaborative environment? If so, we want to hear from you! We’re looking for a Data Engineer to work with the AI platform team within our Cloud and Software (CSW) Group at Lenovo.
About the role
In this role you will work directly with locally & globally distributed teams responsible for defining, designing and building robust data pipelines and data serving platforms in a cloud-based SaaS environment. You’ll be part of a highly dynamic software development team working on initiatives with a goal to transform and enhance the overall business value of our products and platforms. You will be responsible for using distributed data processing frameworks to ingest, extract, transform, store, serve and build data sets that could be used by AI applications, Data Scientists and ML engineers. You will apply your knowledge of algorithms, pipelines, cloud expertise, AI & ML, data processing, supporting tools and technologies to develop new data pipeline solutions, manage and maintain existing pipelines. As a Data Engineer on the team, you will play a key role in improving existing data models, pipelines and maintaining them for a worldwide customer base.
This is a great opportunity if you are:
- passionate about data
- have a strong sense of responsibility and ownership
- resourceful in face of ambiguity and thrive on change
- an independent thinker who can solve complex problems
- an excellent collaborator and with solid communication skills, demonstrated by successful cross team collaboration
Responsibilities:
- In this role, you will work on the various data ingestion and storage platform that is an integral part of the AI ecosystem
- The data you shape will be used to power AI capabilities of the core platform that enables various cloud solutions of the company.
- As a Data Engineer, you will be responsible for developing new data pipelines for data ingestion and transformation, building/updating capabilities of the existing data pipelines including real-time streaming and batch processing.
- Testing the data pipelines for Quality, Data integrity and Validity is an integral part of this role.
- You will take end to end ownership of implementing solutions to the identified issues with the focus on quality, stability, security and customer satisfaction.
- You'll collaborate with a multidisciplinary, globally distributed team of professionals that can include Data Scientists, Machine Learning Engineers, Business Analysts, Project and Product Management
- Designing, building, implementing, and documenting data models
- Working with business partners to understand business and product objectives, identify the data needed to support them, while influencing the decisions
- Optimizing data transformation pipelines to improve latency or reduce computational time, cost.
Minimum Qualifications:
- Bachelor's degree in computer science, Information Systems, Engineering, Math or related technical field.
- 1+ year of experience developing and maintaining data processing pipelines using Spark, Hadoop, Hive.
Preferred Qualifications:
- Masters degree is a plus
- Experience with programming languages such as Groovy and Python (preferred)
- Professional experience in Data Engineering, and/or building scalable streaming and/or batch data pipelines
- Experience in Data Engineering tooling: collection, cleaning, transformation, ingestion, storage, publishing
- Advanced SQL skills (such as window functions, defining UDFs)
- Experience working with relational as well as NoSQL databases and streaming platforms such as Kafka
- Knowledge of Cloud technologies and concepts is preferred, especially Athena, QuickSight & Q.
- Familiarity with version control systems, CI/CD practices, testing
- Experience with data discovery, lineage, data governance, data orchestration, data quality metrics measurement is plus
- Experience working with machine learning engineers, data scientists, and ML applications is plus
- Familiarity with Angular and other UI frameworks is a plus
This job is no longer accepting applications
See open jobs at Lenovo.See open jobs similar to "Data Pipeline Engineer" Out for Undergrad.