Job Description
We are seeking a highly skilled and experienced Senior Data Scientist to join our dynamic team. In this role, you will leverage your expertise in data science, machine learning, and natural language processing (NLP) to design, implement, and optimize ML models. You will work with large-scale data systems and streaming data environments to deliver impactful solutions.
We are a company committed to creating inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity employer that believes everyone matters. Qualified candidates will receive consideration for employment opportunities without regard to race, religion, sex, age, marital status, national origin, sexual orientation, citizenship status, disability, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to
Human Resources Request Form. The EEOC "Know Your Rights" Poster is available
here.
To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy:
https://insightglobal.com/workforce-privacy-policy/ .
Required Skills & Experience
5+ years of experience in applied data science or ML roles, including using Python and NLP and LLM implementation
5+ years of experience with data exploration, data cleaning, data analysis, data visualization, or data mining
Experience with production-level systems, data lake environments, and streaming data, including Kafka
Experience implementing end-to-end ML workflows from data prep to deployment and evaluation
Ability to quickly learn infrastructure or systems concepts, including how pipelines interface with data lakes
Ability to design, implement, and iterate on ML models for document classification, extraction, summarization, and search
Ability to take ownership of data science workflows that interact with a production system streaming millions of documents per week
TS/SCI clearance
Bachelor's degree
Nice to Have Skills & Experience
Experience in collaborating with MLOps and infrastructure engineers to ensure robust model deployment, monitoring, and retraining pipelines
Experience supporting platform components such as documents indexing or search, GPU workloads, and distributed storage, including Cloudera
Experience in the development of algorithms leveraging R, Python, SQL, or NoSQL
Experience with Distributed data or computing tools, including MapReduce, Hadoop, Hive, EMR, Spark, Gurobi, or MySQL
Experience with visualization packages, including Plotly, Seaborn, or ggplot2
Benefit packages for this role will start on the 31st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.