Remote Senior Data Engineer

Post Date

May 06, 2026

Location

Upper Providence,
Pennsylvania

ZIP/Postal Code

19426
US
Aug 15, 2026 Insight Global

Job Type

Contract

Category

Engineering (Non IT)

Req #

PHL-e1d06503-2b5a-4729-90d1-27f6c4df3b83

Pay Rate

$68 - $85 (hourly estimate)

Job Description

The Senior Data Engineer will design, build, and deliver a new enterprise data product supporting the clients generative drug design and computational chemistry platforms. This role focuses on creating scalable, well‑structured data architecture from the ground up, with long‑term expansion and downstream AI/ML integration in mind. The ideal candidate combines strong data engineering expertise with an understanding of drug design, chemistry, and scientific data workflows.

-Design and implement a new enterprise data product, initially scoped as a standalone deliverable with future integration into broader AI‑driven drug discovery platforms.
-Build scalable data pipelines, schemas, and storage models capable of supporting large, complex scientific and chemistry‑derived datasets.
-Develop data solutions primarily on GCP / BigQuery, adhering to enterprise data engineering templates and standards.
-Implement data transformations and pipelines using Python, with a focus on data quality, traceability, and performance.
-Ensure the data architecture supports future expansion, additional datasets, and evolving analytical and computational needs.
-Collaborate closely with computational chemists, data scientists, and ML engineers to ensure data models align with generative design, molecular representations, and ML outputs.
-Apply an understanding of drug design and chemistry concepts (e.g., molecular properties, structure‑activity data, experimental outputs) to inform data modeling and integration decisions.
-Provide technical guidance on data structure, scalability, and long‑term maintainability in an enterprise environment.


The Data Engineer will take end‑to‑end ownership of a new cloud‑native data product on Google Cloud Platform, leveraging established in‑house templates and standards to deliver a robust, scalable, and production‑grade solution. The role involves designing and operating reliable ingestion pipelines for external data sources, integrating and harmonising key fields with parallel external data products, and delivering curated, analytics‑ready datasets that are readable, updatable, and trusted by downstream users. The engineer will apply strong expertise in Python, SQL, columnar data warehouses (e.g. BigQuery), schema and data‑model design, pipeline orchestration, and query optimisation, while embedding best practices around data quality, testing, metadata, and documentation. Operating as part of a cross‑functional environment, the role requires a solid understanding of core cloud concepts, CI/CD, version control, and data reliability in production. Experience with scientific or R&D data—particularly chemistry or related life‑science domains—would be highly advantageous, enabling effective standardisation, interpretation, and integration of complex domain‑specific datasets.

We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com.To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/.

Required Skills & Experience

-Strong experience in data engineering, including database, schema, and data product design.
-Hands‑on experience with GCP and BigQuery (Postgres familiarity a plus).
-Proficiency in Python for building and maintaining data pipelines.
-CI/CD
-Experience working with large, complex datasets at scale, ideally in scientific or R&D contexts.
-Background in life sciences, pharma, or scientific data platforms.

Nice to Have Skills & Experience

-Database Design
-Experience supporting downstream analytics, ML pipelines, or AI‑driven platforms, particularly in R&D or discovery environments.
-Background in life sciences, pharma, or scientific data platforms.
-Working knowledge or hands‑on exposure to drug design, chemistry, or computational chemistry data.

Benefit packages for this role will start on the 1st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.