Job Description
We are seeking an experienced Site Reliability Engineer (SRE) with deep expertise supporting Azure Virtual Desktop (AVD) environments at an enterprise scale. This individual will be responsible for ensuring the reliability, performance, observability, and cost efficiency of a large AVD platform supporting 600+ users across shared compute environments.
Key Responsibilities:
Own the reliability, availability, and performance of enterprise‑scale Azure Virtual Desktop (AVD) environments supporting 600+ users.
Deploy, maintain, and optimize shared pooled compute models.
Proactively monitor, troubleshoot, and resolve AVD platform issues across compute, networking, storage, and identity.
Implement observability (monitoring, logging, alerting) and lead incident response and root-cause analysis.
Drive automation and continuous improvement to reduce manual effort and increase stability.
Manage AVD cost optimization, balancing performance, reliability, and cloud spend.
Support FSLogix profile management and integrations with Zscaler and enterprise security standards.
Perform ongoing platform maintenance, patching, and lifecycle management.
We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com.To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/.
Required Skills & Experience
5+ years of experience in an SRE role supporting Azure Virtual Desktop in large-scale, enterprise environments.
Proven experience managing environments with large-scale users across shared compute models.
Hands-on experience with FSLogix (profiles, storage performance, troubleshooting).
Experience with Zscaler cloud integrations in enterprise desktop or cloud environments.
Strong troubleshooting experience in Azure, including compute, networking, storage, and identity.
Proficiency with PowerShell and scripting for automation and operational efficiency.
Experience implementing monitoring, logging, and alerting in Azure-based environments.
Strong understanding of reliability engineering principles, operational excellence, and incident management.
Benefit packages for this role will start on the 1st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.