INTL - INDIA - SRE System Monitor- 3bced1c5

Post Date

Jul 16, 2025

Location

San Jose,
California

ZIP/Postal Code

95134
US
Nov 04, 2025 Insight Global

Job Type

Contract

Category

Help Desk

Req #

SJC-55270ee7-5ad6-45f6-9c61-a408b9a4ef73

Pay Rate

$13 - $16 (hourly estimate)

Job Description

Insight Global is seeking a skilled LLM System Monitor to support the LLM Proxy team. You will be the person monitoring and interpreting the Grafana dashboards that will signal failures and problems in order to manage the incident communication. On a day-to-day basis you will be the SRE monitoring the observability dashboards. You will either begin an incident report yourself from an automated alert, or will be pulled into a chat zone by someone who has created a ticket. From here you will be the main point of contact exhibiting great communication to the end customer and the incident commander. You will give frequent updates of the status of the incident to all parties.

We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com.To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/.

Required Skills & Experience

-3 years of experience responding and monitoring a globally deployed web application (keeping track of permutations)
-Experience working with microservices that run on a Kubernetes background
-Metrics forward thought process and a strong understanding of observability tools focusing on operational Metrics: Quantiles, P99, and Prometheus
-Familiarity with AWS services or any cloud provider – foundational understanding
Very Strong Communication and Customer service skills

Nice to Have Skills & Experience

LLM or AI Experience

Benefit packages for this role will start on the 31st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.