Job Description
Insight Global is seeking a Data Engineer to support a pharmaceutical client based in West Point, PA. This is a hybrid role, requiring 3 days a week on-site.
The Data Engineer will be a part of the Digital Sciences and Enabling Capabilities team within a greater R&D organization and will help establish data workflows for predictive tools to enable more effective identification, characterization, and development of novel medicines and vaccines. This Data Engineer will impact all aspects of the drug discovery and development pipeline, including projects spanning data workflows, instrument metrology, and predictive sciences. This team helps to enable work across all drug modalities, including small molecule, peptide, biologics, vaccines, and beyond.
We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com.To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/.
Required Skills & Experience
o Python software development
o Cloud Services – AWS (Lambda Functions, S3, Cloud Formation Templates, RDS, ECR)
o Databases - relational databases, SQL, data modeling and design
o Development of ETL Processes / Data Workflows / Data Pipelines / Data Wrangling / Data Ingestion.
o Software design, development, and testing (unit testing and system testing)
o Version control - Git, GitHub
o CI/CD - GitHub Actions
o File Formats (XLXS, YAML, JSON, CSV, TSV)
Nice to Have Skills & Experience
Life Science industry expo Cloud Services – AWS (SQS, DLQ, SNS, EventBridge, API Gateway)
o Development of ETL Processes / Data Workflows / Data Pipelines / Data Wrangling / Data Ingestion.
Python packages (Cerberus, PyYAML, logging)
Python linters and type hints; regular expressions
o Experience with data pipeline tools such as Dataiku or Trifacta
Benefit packages for this role will start on the 31st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.