GPU Infrastructure Solutions Architect

Post Date

Feb 05, 2026

Location

South San Francisco,
California

ZIP/Postal Code

94080
US
Apr 13, 2026 Insight Global

Job Type

Perm

Category

Architect

Req #

DGO-2ab8304f-7295-4b82-aaf3-50d310a9e89c

Pay Rate

$200k - $250k (estimate)

Job Description

We’re partnering with a fast‑growing technology company building large‑scale compute infrastructure used to power advanced AI workloads. They’re hiring a Solutions Architect (GPU Infrastructure) to work directly with customers to design, deploy, and support high‑performance GPU cluster environments.

In this role, you will gather customer requirements, recommend infrastructure designs, and help implement production‑ready systems for distributed workloads. You’ll also support ongoing operations by troubleshooting issues across hardware, networking, drivers, and software, and by improving reliability through automation, monitoring, and documentation.

This is a hands‑on, customer-facing role for someone who enjoys solving complex infrastructure problems and delivering stable, high‑performance systems.

** Must sit onsite in San Francisco, CA. **

Annual Salary Target: $200K - 250K (exact compensation may vary based on several factors, including skills, experience, and education.)

We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com.To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/.

Required Skills & Experience

- 3+ years of hands‑on experience supporting GPU clusters and/or HPC environments in production
- Experience deploying or operating cluster orchestration/scheduling tools such as SLURM and/or Kubernetes in production environments
- Strong background in infrastructure automation using tools like Terraform and/or Ansible
- Proven ability to troubleshoot high‑performance networking, ideally InfiniBand (configuration + debugging)
- Working knowledge of the NVIDIA GPU software stack (drivers/CUDA) and comfort troubleshooting install/runtime issues
- Strong Linux fundamentals and scripting skills (Python and/or Bash)
- Experience working directly with customers/partners: gathering requirements, presenting recommendations, and owning technical outcomes
- Ability to participate in an on‑call rotation supporting critical production environments

Nice to Have Skills & Experience

- Experience supporting very large clusters (e.g., 1,000+ GPUs) or high‑density compute deployments
- Experience tuning performance end‑to‑end (Linux kernel/system settings - GPU/CUDA performance)
- Exposure to distributed training or large‑scale AI frameworks (e.g., DeepSpeed, Megatron‑LM, PyTorch FSDP)
- Familiarity with GPU platforms such as DGX/HGX/SuperPOD or equivalent high‑performance designs
- Exposure to non‑NVIDIA accelerators (AMD MI300, Intel Gaudi)
- Contributions to open‑source HPC/AI infrastructure projects

Benefit packages for this role will start on the 1st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.