Job Description
We are seeking an experienced Site Reliability Engineering (SRE) Manager to lead and scale a high-performing SRE organization. This role will oversee multiple teams, establish governance standards, and drive reliability, performance, and automation initiatives across a complex, multi-division environment.
This is a high-impact leadership role requiring both strong technical depth and proven people management experience. The ideal candidate will help define best practices, evaluate emerging technologies, and ensure consistent execution of SRE principles across the organization.
Key Responsibilities
Lead and manage an SRE organization of ~18–22 engineers, including team leads and sub-teams
Establish and enforce SRE governance frameworks, standards, and best practices across multiple divisions
Drive reliability, scalability, and performance improvements, reducing recurring incidents and production issues
Oversee observability, monitoring, and incident response strategies
Partner with engineering and product teams to align on SLAs, SLIs, and SLOs
Evaluate and implement emerging technologies (e.g., AI-driven monitoring, automation agents)
Ensure consistent adoption of automation and operational excellence practices
Provide leadership in technical decision-making and step in to solve complex production challenges when needed
Mentor and develop team leads and engineers, ensuring accountability and delivery across teams
Support a standardized operating model that can scale across multiple business units
We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com.To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/.
Required Skills & Experience
10–20+ years of overall IT experience, with deep technical foundation
7–8+ years of hands-on SRE experience in production environments
Proven experience as a people manager (not first-time manager) leading teams at scale
Experience managing large teams (~15–20+ engineers), including layered team structures
Strong expertise in Azure (primary cloud platform)
Working knowledge or exposure to AWS
Experience with .NET / application development fundamentals
Strong understanding of the Software Development Lifecycle (SDLC) including design, testing, and deployment
Experience with observability, monitoring, and incident management frameworks
Ability to define and implement governance models and operational standards
Experience driving cross-team alignment across multiple divisions or business units
Strong leadership skills with ability to mentor leads and influence senior stakeholders
Nice to Have Skills & Experience
Experience implementing AI/ML-driven SRE solutions (e.g., intelligent alerting, automation agents)
Exposure to advanced observability tooling and monitoring pipelines
Experience with PostgreSQL or similar database technologies
Background in both infrastructure and application-focused SRE models
Experience in high-growth or highly complex enterprise environments
Prior experience helping organizations scale SRE practices across multiple teams or geographies
Exposure to platform engineering concepts
Experience working in offshore/global delivery models (e.g., India-based teams)
Benefit packages for this role will start on the 1st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.