Remote GCP Data Engineer

Post Date

Feb 06, 2024

Location

Dearborn,
Michigan

ZIP/Postal Code

48124

Job Type

Contract

Job Description

JOB DESCRIPTION The Ford Public Charging Data team is seeking a Data Engineer to create, deliver, and support custom data products, as well as enhance/expand capabilities. They will work on analyzing and manipulating large datasets supporting public charging locations. The focus will be understanding available data, working with business partners to collect requirements, ensure compliance with Data Governance and OGC use case approvals, creating data products, productionize delivery, profiling, monitoring and ongoing support. They will expand their business knowledge, cross functional collaboration, business intelligence acumen, and technical experience. RESPONSIBILITIES Job Description: * Develop EL/ELT/ETL pipelines by using Python/Pyspark to make data available in BigQuery analytical data store from disparate batch, streaming data sources for the Charging / Energy Analytics Product Line. * Orchestrate workflows by using Astronomer (Airflow) that execute ETL data pipelines in a scheduled manner. * Work with Cloud data sources (GCP) understand the data model, business rules behind the data and build data pipelines (with GCP) for one or more business domains. * Build cloud-native services and APIs to support and expose data-driven solutions. * Partner closely with our data scientists to ensure the right data is made available in a timely manner to deliver compelling and insightful solutions. * Design, build and launch shared data services to be leveraged by the internal and external partner developer community. * Building out scalable data pipelines and choosing the right tools for the right job. Manage, optimize and Monitor data pipelines. * Provide extensive technical, strategic advice and guidance to key stakeholders around data transformation efforts. Understand how data is useful to the enterprise.

Required Skills & Experience

Skills Required:

Comfortable with a broad array of relational and non-relational databases. Proven track record of building applications in a data-focused role (Cloud and Traditional Data Warehouse) BigQuery, Astronomer, Python, airflow, dataproc

Skills Preferred:

terraform, GCP cloud services, pub sub, Vertex AI, ML Flow

Experience Required:

* 7+ years of experience with SQL and Python * 2+ years of experience with GCP or AWS cloud services; Strong candidates with 5+ years in a traditional data warehouse environment (ETL pipelines) will be considered * 3+ years of experience building out data pipelines from scratch in a highly distributed and fault-tolerant manner.

Benefit packages for this role will start on the 1st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.