Job Description
-Build and own production search & AI services end-to-end (Python APIs, ML inference pipelines [embeddings, transformers, LLMs], and real-time, event-driven systems on GCP)
-Contribute to modern search infrastructure, implementing hybrid retrieval (keyword + vector + reranking), Elasticsearch pipelines, vector databases, and relevance measurement through experimentation and metrics.
-Set a high engineering bar, leading design discussions, enforcing CI/CD and observability best practices, and actively mentoring through reviews and knowledge sharing.
We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com.To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/.
Required Skills & Experience
•4+ years of professional backend or full-stack engineering experience with a deep Python focus (async patterns, type annotations, testing, and production-grade service design.)
•Proven experience designing and deploying cloud-native applications (GCP strongly preferred; AWS or Azure considered).
•Hands-on experience building resilient microservices and RESTful/gRPC APIs.
•Strong understanding of containerization (Docker), orchestration (Kubernetes), and serverless paradigms.
•Strong grounding in SOLID design principles and software craftsmanship.
•Good communicator who thrives in cross-functional, agile teams alongside ML engineers, architects, and product owners.
•Comfortable using AI tools to accelerate development throughput.
Nice to Have Skills & Experience
•Experience with search platforms such as Elasticsearch, OpenSearch, Solr, or Algolia—index management, query DSL, relevance tuning.
•Familiarity with vector search concepts and tooling such as embeddings, approximate nearest neighbor (ANN), FAISS, Pinecone, Weaviate, or similar.
•Exposure to ML/AI patterns: RAG pipelines, LLM integration, prompt engineering, or fine-tuning workflows.
•Experience with AI orchestration frameworks such as LangChain, LangGraph, or Google ADK.
•Infrastructure-as-code experience (Terraform, Pulumi) and mature CI/CD pipeline ownership.
Benefit packages for this role will start on the 1st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.