Data Engineer

Post Date

Mar 28, 2025

Location

Houston,
Texas

ZIP/Postal Code

77204
US
Sep 19, 2025 Insight Global

Job Type

Contract

Category

Data Warehousing

Req #

HOU-772504

Pay Rate

$52 - $65 (hourly estimate)

Job Description

An Insight Global client in the Houston area is looking for a Data Engineer to join their team for a contract opportunity. The client is seeking a highly skilled professional with experience in designing and implementing large-scale Data Lakes and Data Warehouses in the cloud. The ideal candidate will have expertise in using one or more query languages (e.g., SQL, HiveQL, SPARQL), schema definition languages (e.g., DDL, SDL, XSD, RDF), and scripting languages (e.g., Python, Scala) to build robust data solutions. A strong understanding of distributed system concepts from both storage and compute perspectives is essential, including data persistence solutions (e.g., HDFS vs RDBMS), data integration techniques (e.g., ETL vs federation), database optimization (e.g., partitioning, distribution, indexing), and join algorithms (e.g., hash vs nested loop). Additionally, experience in CI/CD automation for a data lake using CDK (Cloud Development Kit) or Terraform is highly desirable.

We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com.

To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/ .

Required Skills & Experience

(Preferably 3+ years) Experience in designing, implementing large scale Data Lake and Data Warehouse in the cloud

Experience in using one or more query languages (e.g. SQL, HiveQL, SPARQL), schema definition languages (e.g. DDL, SDL, XSD, RDF), and scripting languages (e.g. Python, Scala) to build a data solution

Understanding of distributed system concepts from storage and compute perspectives, including data persistence solutions (e.g. HDFS vs RDBMS), data integration techniques (e.g. ETL vs federation), database optimization (e.g. partitioning, distribution, indexing), and join algorithms (e.g. hash vs nested loop)

Experience in CI/CD automation for a data lake using CDK (Cloud Development Kit) or Terraform

Experience with healthcare data formats such as HL7/FHIR, genomics data, and medical imaging data - protocols pulling data from healthcare systems

Nice to Have Skills & Experience

Understanding of healthcare infrastructure and security requirements

Understanding of database and analytical technologies in the industry including MPP and NoSQL databases, Data Warehouse design, and BI Ops

Benefit packages for this role will start on the 31st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.