General Information

Req #

WD00071207

Career area:

Data Management and Analytics

Country/Region:

United States of America

State:

North Carolina

City:

Morrisville

Date:

Tuesday, September 3, 2024

Working time:

Full-time

Additional Locations:

* United States of America - North Carolina - Morrisville

Why Work at Lenovo

We are Lenovo. We do what we say. We own what we do. We WOW our customers.

Lenovo is a US$62 billion revenue global technology powerhouse, ranked #217 in the Fortune Global 500, employing 77,000 people around the world, and serving millions of customers every day in 180 markets. Focused on a bold vision to deliver smarter technology for all, Lenovo has built on its success as the world’s largest PC company by further expanding into growth areas that fuel the advancement of ‘New IT’ technologies (client, edge, cloud, network, and intelligence) including server, storage, mobile, software, solutions, and services.

This transformation together with Lenovo’s world-changing innovation is building a more inclusive, trustworthy, and smarter future for everyone, everywhere. To find out more visit www.lenovo.com, and read about the latest news via our StoryHub.

Description and Requirements

At Lenovo, we Never Stand Still. Every day, every employee at Lenovo is focused on moving forward, rejecting traditional limits, and always seeking a better way.

Do you have a passion for data and building pipelines that power cutting edge AI? Are you a results-oriented engineer who thrives in a collaborative environment? If so, we want to hear from you! We’re looking for a Data Engineer to work with the AI platform team within our Cloud and Software (CSW) Group at Lenovo.

About the role

In this role you will work directly with locally & globally distributed teams responsible for defining, designing and building robust data pipelines and data serving platforms in a cloud-based SaaS environment. You’ll be part of a highly dynamic software development team working on initiatives with a goal to transform and enhance the overall business value of our products and platforms. You will be responsible for using distributed data processing frameworks to ingest, extract, transform, store, serve and build data sets that could be used by AI applications, Data Scientists and ML engineers. You will apply your knowledge of algorithms, pipelines, cloud expertise, AI & ML, data processing, supporting tools and technologies to develop new data pipeline solutions, manage and maintain existing pipelines. As a Data Engineer on the team, you will play a key role in improving existing data models, pipelines and maintaining them for a worldwide customer base.

This is a great opportunity if you are:

passionate about data
have a strong sense of responsibility and ownership
resourceful in face of ambiguity and thrive on change
an independent thinker who can solve complex problems
an excellent collaborator and with solid communication skills, demonstrated by successful cross team collaboration

Responsibilities:

In this role, you will work on the various data ingestion and storage platform that is an integral part of the AI ecosystem
The data you shape will be used to power AI capabilities of the core platform that enables various cloud solutions of the company.
As a Data Engineer, you will be responsible for developing new data pipelines for data ingestion and transformation, building/updating capabilities of the existing data pipelines including real-time streaming and batch processing.
Testing the data pipelines for Quality, Data integrity and Validity is an integral part of this role.
You will take end to end ownership of implementing solutions to the identified issues with the focus on quality, stability, security and customer satisfaction.
You'll collaborate with a multidisciplinary, globally distributed team of professionals that can include Data Scientists, Machine Learning Engineers, Business Analysts, Project and Product Management
Designing, building, implementing, and documenting data models
Working with business partners to understand business and product objectives, identify the data needed to support them, while influencing the decisions
Optimizing data transformation pipelines to improve latency or reduce computational time, cost.

Minimum Qualifications:

Bachelor's degree in computer science, Information Systems, Engineering, Math or related technical field.
1+ year of experience developing and maintaining data processing pipelines using Spark, Hadoop, Hive.

Preferred Qualifications:

Masters degree is a plus
Experience with programming languages such as Groovy and Python (preferred)
Professional experience in Data Engineering, and/or building scalable streaming and/or batch data pipelines
Experience in Data Engineering tooling: collection, cleaning, transformation, ingestion, storage, publishing
Advanced SQL skills (such as window functions, defining UDFs)
Experience working with relational as well as NoSQL databases and streaming platforms such as Kafka
Knowledge of Cloud technologies and concepts is preferred, especially Athena, QuickSight & Q.
Familiarity with version control systems, CI/CD practices, testing
Experience with data discovery, lineage, data governance, data orchestration, data quality metrics measurement is plus
Experience working with machine learning engineers, data scientists, and ML applications is plus
Familiarity with Angular and other UI frameworks is a plus

We are an Equal Opportunity Employer and do not discriminate against any employee or applicant for employment because of race, color, sex, age, religion, sexual orientation, gender identity, national origin, status as a veteran, and basis of disability or any federal, state, or local protected class.

Additional Locations:

* United States of America - North Carolina - Morrisville

* United States of America

* United States of America - North Carolina

* United States of America - North Carolina - Morrisville

Find Your Dream Job Today

Data Pipeline Engineer

General Information

Why Work at Lenovo

Description and Requirements