Job Description
Insight Global is seeking a Senior Data Scientist / Engineer with a strong background in data infrastructure and large-scale data environments to support U.S. Space Command mission data initiatives.
This role supports the definition, architecture, and engineering of a large-scale on-premises data environment designed to ingest, store, process, and distribute space domain data across multiple classification levels.
The selected candidate will work closely with requirements engineers, system architects, and government stakeholders to define technical solutions for a mission-scale data platform supporting 30+ petabytes of data and scaling to 200–500 racks of compute, storage, and network infrastructure.
This is not a traditional data science role. The position sits at the intersection of:
Data engineering
Data architecture
Infrastructure engineering
Systems engineering
and will play a key role in shaping the technical direction, requirements development, and acquisition strategy for mission data systems.
This role focuses on designing and supporting a large-scale data environment that will process and manage significant volumes of space domain data. The system is expected to scale to tens of petabytes of data and hundreds of infrastructure racks, supporting advanced analytics and mission operations.
The ideal candidate has experience working with large data environments, distributed systems, or high-performance computing platforms, and understands how data platforms are designed from both a software and infrastructure perspective.
This position offers the opportunity to support critical national security missions while working on complex, large-scale data and infrastructure challenges.
We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com.To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/.
Required Skills & Experience
-Bachelor’s degree in computer science, data science, engineering, or related quantitative field, and a current Top Secret/Sensitive Compartmented Information (TS/SCI) security clearance
-Minimum 7 years of progressive experience across both data engineering and data science disciplines, including expert proficiency in Python and Structured Query Language (SQL); demonstrated success designing Extract, Transform, Load/Extract, Load, Transform (ETL/ELT) pipelines; developing and deploying machine learning models; and working with big data technologies such as Apache Spark, Apache Airflow, or Apache Hadoop
-Demonstrated knowledge of enterprise data architecture and storage hardware (e.g. Storage Area Networks (SAN), Network Attached Storage (NAS), and object storage), data center design principles, data governance practices, and understanding of space data characteristics, including experience with the National Space Intelligence Center (NSIC) Data Catalog
-Develop architecture for large-scale data platforms supporting petabyte-scale storage and analytics workloads
-Define infrastructure requirements including:
Support planning for scalable environments r--aranging from 200–500 racks
-Design distributed data environments including:
Translate mission needs into technical system requirements and design artifacts
-Support capacity planning, including threshold vs objective growth models
-Assist with OTA, RFP, and acquisition planning activities
-Evaluate vendor solutions and participate in technical reviews
-Collaborate across cybersecurity, network, infrastructure, and mission teams
-Support development of architecture documentation and engineering packages
Nice to Have Skills & Experience
-Master’s degree or Doctor of Philosophy (PhD) in computer science, engineering, data science, or related field
-Experience designing or supporting large-scale data environments (petabyte-scale)
-Strong understanding of data center infrastructure and architecture
-Experience with distributed data systems and high-performance computing (HPC) environments
-Knowledge of data storage architectures (object storage, distributed systems)
-Experience with data ingestion and large-scale analytics pipelines
-Familiarity with DoD or Intelligence Community environments
-Understanding of multi-classification or cross-domain architectures
-Experience supporting technical requirements development and acquisition processes
-Knowledge of data center and telecommunications standards (TIA-942, UFC guidance)
Benefit packages for this role will start on the 1st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.