Insight Global is looking for a Site Reliability Engineer to help triage cloud based applications within a DevOps environment.
Run the production environment by monitoring availability and taking a holistic view of system health
Support the applications with OnCall rotation support.
Provide stability to our applications and facilitates rapid feature development by taking active control on direction of the service and be proactive
Automate and eliminate manual work and look for opportunities for automation
Maintaining and implementing the SLO implementation adoption and automation
Production Readiness/Health Scoring & Error Budget Tracking
Runbook standards, maintenance, and updates
We are a company committed to creating inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity employer that believes everyone matters. Qualified candidates will receive consideration for employment opportunities without regard to race, religion, sex, age, marital status, national origin, sexual orientation, citizenship status, disability, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to
Human Resources Request Form. The EEOC "Know Your Rights" Poster is available
here.
To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy:
https://insightglobal.com/workforce-privacy-policy/ .
Experience using DevOps tools and technologies such as GitLab, and Infrastructure as Code tools such as Terraform
Strong troubleshooting skills and building and enhancing the observability using monitoring tools
Proactive approach to Observability maturity, identifying problems, performance bottlenecks, and areas for improvement for observability
Leading incident response and supporting application teams. Blameless postmortems
Developer feedback for enhanced logging, runbooks and addressing technical debt. Promoting observability best practices
Experience in monitoring tools Dynatrace & Splunk
Experience in public cloud platforms, preferably AWS and Api gateways
Experience developing API or Microservices or Frontend is a plus
Experience using source version control (SVC) such as Git
Benefit packages for this role will start on the 31st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.