Senior Network Reliability Engineer

Post Date

Mar 17, 2026

Location

San Francisco,
California

ZIP/Postal Code

94103
US
May 19, 2026 Insight Global

Job Type

Perm

Category

Network Engineer

Req #

DGW-7abfec37-5d69-40fd-9fd5-e0392d048c40

Pay Rate

$150k - $250k (estimate)

Job Description

Insight Global is seeking a Network Engineer – Reliability & Observability to support the quality, reliability, and lifecycle performance of large-scale AI network infrastructure. This role serves as a reliability engineering leader, responsible for building processes, data collection frameworks, and reliability metrics to improve network performance from initial deployment through ongoing operations.
This position focuses on developing scalable processes, systems, tooling, and data pipelines that drive network observability and reliability. You will deliver automated 24x7 metrics as well as periodic reliability reporting for both internal stakeholders and external customers, ensuring visibility into network health, performance, and risk.
This role is well-suited for experienced network operators who are passionate about reliability engineering and full-lifecycle software development, including quality assurance audits, circuit audits, periodic inspections, failure rate tracking, and root cause analysis. Ideal candidates bring a strong interest in both hardware (electronics and optics) and software development, and consistently leverage data to guide deployment decisions, operational improvements, and strategic sourcing.
Experienced Site Reliability Engineers (SREs) with a strong networking background and a focus on observability and reliability are strongly encouraged to apply.

We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com.To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/.

Required Skills & Experience

• 5–8+ years’ experience in large scale / hyperscale network operations
• Strong background as an SRE, NRE, or Network Engineer with software focus
• Experience building software for observability (metrics, data stores, dashboards)
• Strong automation and coding skills (Python, SQL; others acceptable)
• Ability to analyze data and say “this is what we need to measure”
• Excellent communication skills — can present technical insights to executives
• Experience working with routers, switches, interface cards

Nice to Have Skills & Experience

• Data science or analytics experience applied to infrastructure
• Experience with network reliability, performance analysis, or capacity planning
• Familiarity with modern observability stacks and custom dashboards
• Background in deployment, operations, or repair environments
• Prior titles such as NRE, Network SRE, Network Architect

Benefit packages for this role will start on the 1st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.