Senior Site Reliability Engineer

Post Date

Aug 19, 2025

Location

Phoenix,
Arizona

ZIP/Postal Code

85004
US
Oct 28, 2025 Insight Global

Job Type

Perm

Category

Security Engineering

Req #

DC0-ef16d91d-b785-44a2-b9c0-936efd3abb74

Pay Rate

$150k - $170k (estimate)

Job Description

Ensure Reliability & Performance: Own the observability of our systems, ensuring they meet established service-level objectives (SLOs) and maintain high availability.
Cloud & Container Orchestration: Deploy, configure, and manage resources on Google Cloud Platform (GCP) and Google Kubernetes Engine (GKE), focusing on secure and scalable infrastructures.
Infrastructure Automation & Tooling: Set up and maintain automated build and deployment pipelines; drive continuous improvements to reduce manual work and risks.
Monitoring & Alerting: Develop and refine comprehensive monitoring solutions (performance, uptime, error rates, etc.) to detect issues early and minimize downtime.
Incident Management & Troubleshooting: Participate in on-call rotations; manage incidents through resolution, investigate root causes, and create blameless postmortems to prevent recurrences.
Collaboration with Development: Partner with development teams to design and release services that are production-ready from day one, emphasizing reliability, scalability, and performance.
Security & Compliance: Integrate security best practices into system design and operations; maintain compliance with SOC 2 and other relevant standards.
Performance & Capacity Planning: Continuously assess system performance and capacity; propose and implement improvements to meet current and future demands.

We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com.To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/.

Required Skills & Experience

Education & Experience:
Bachelor’s degree in Computer Science, Business Administration, or relevant work experience
A minimum of 5+ years in an SRE, DevOps, or similar role in an IT environment, required
System Administration Expertise:
Hands-on experience with Microsoft SQL Clusters, Elasticsearch, Kubernetes, required
Deep familiarity with Windows or Linux environments and .NET or PHP stack applications, including IIS/Apache, SQL Server/MySQL, etc.
Strong understanding of networking, firewalls, intrusion detection, and security best practices
CI/CD & Automation:
Proven administrative experience with tools like GIT, TFS, Bitbucket, and Bamboo for Continuous Integration, Delivery, and Deployment
Knowledge of automation testing tools such as SonarQube, Selenium, or comparable technologies
Observability & Monitoring:
Experience with performance profiling, logging, metrics collection, and alerting tools
Competence in debugging solutions across diverse environments
Cloud & Microservices:
Hands-on experience with GCP, AWS, or Azure, container orchestration (Kubernetes), and microservices-based architectures
Security Acumen:
Understanding of authentication, authorization, OAUTH, SAML, encryption (public/private key, symmetric, asymmetric), token validation, and SSO
Familiarity with security strategies to optimize performance while maintaining compliance (e.g., SOC 2)
On-Call Support:
Willingness to participate in an on-call rotation and respond to system emergencies 24/7 when necessary
Monthly weekend rotation for Production Patching
Certifications (nice to have):
A+, MCP, Dell certifications
Microsoft Office expertise

Nice to Have Skills & Experience

Technical Evangelism: Contribute to cultivating a culture of reliability through training, documentation, and mentorship across the organization.

Benefit packages for this role will start on the 31st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.