Job Description
Senior Data Scientist will lead the design and deployment of advanced Retrieval-Augmented Generation (RAG) systems and Agentic AI architectures tailored for legal and regulatory intelligence platforms. This role will drive innovation in enterprise-scale AI systems that power legal research, case law summarization, statutory interpretation, document intelligence, and workflow automation for legal professionals. The ideal candidate combines deep expertise in machine learning, NLP, LLM systems engineering, and distributed AI architectures, with some familiarity in legal/law data ecosystems.
Key Responsibilities
• Lead the architecture and development of RAG-based systems to enhance information retrieval and knowledge synthesis from complex legal datasets
• Design and implement Agentic AI frameworks capable of multi-step reasoning, task orchestration, and autonomous decision-making
• Develop and optimize LLM-powered applications, including fine-tuning, prompt engineering, and evaluation pipelines
• Build scalable machine learning and NLP solutions for document classification, summarization, entity extraction, and semantic search
• Collaborate cross-functionally with engineering, product, and domain experts to translate business needs into scalable AI solutions
• Architect and deploy distributed AI systems in cloud environments (AWS, Azure, GCP)
• Establish best practices for model performance monitoring, evaluation, and governance
• Stay current on advancements in AI/ML, LLMs, and legal tech, and drive innovation within the organization
We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com.To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/.
Required Skills & Experience
• 8+ years of experience in data science, machine learning, or AI engineering
• Strong expertise in Natural Language Processing (NLP) and Large Language Models (LLMs)
• Hands-on experience designing and deploying RAG pipelines and vector search systems (e.g., FAISS, Pinecone, Weaviate)
• Experience building or integrating Agentic AI systems or multi-agent frameworks
• Proficiency in Python and ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face)
• Experience with cloud platforms (AWS, Azure, or GCP) and distributed computing
• Strong understanding of data pipelines, APIs, and scalable system design
• Excellent problem-solving skills and ability to work in a fast-paced, collaborative environment
Nice to Have Skills & Experience
• Experience working with legal, regulatory, or compliance datasets
• Familiarity with knowledge graphs, ontologies, or semantic data modeling
• Background in enterprise SaaS or data platform environments
• Exposure to governance, risk, and compliance (GRC) workflows
Benefit packages for this role will start on the 1st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.