Job Description
Insight Global is looking to hire an SRE Lead who functions as a SysOps Administrator in an AWS environment. The idea candidate will have started their career developing Java-based applications the advancing to using DevOps tools to deploy into a cloud environment prior to a recent SysOps role in which they've developed and maintained infrastructure. The SRE Lead will join a team of 7 other SRE Leads who provide 24X7 support to time-critical requests for both on-prem and cloud-based applications (AWS).
Other duties may include:
* Review application architecture and setup monitoring alerts with appropriate SLA thresholds
* Perform Application administrator tasks such as User access, Security minimum baseline implementation, onboarding new interfaces, code deployments, application upgrades/patching and Infrastructure maintenance
* Coordinate with multiple teams to document IT resiliency plans and execute Business Continuity procedures for different software
* Ensure high quality documentation, which may include - Business Requirements, Design Documents, User Manuals, Testing deliverables (Test plans/cases/strategies), user documentation such as help files, how-to manuals
Required Skills & Experience
* AWS Associate Certification -- SysOps Administrator
* 5+ years of experience within SysOps having configured and maintained enterprise-level applications
* Strong technical skills in DevOps Tools -- Jenkins, Git, Docker, Chef, Kubernetes, Terraform
* Good understanding of Server, Storage, Networking and Well Architected Frameworks (WAF)
* Strong development skills creating scripts from scratch for process automation, performance metrics dashboard, and monitoring using PowerShell or Python languages
* 5+ years using critical thinking to debug complex technical issues independently and drive the resolution working with internal teams then documenting Incident, Request and Change/Release management process using JIRA and ServiceNow
* Working knowledge of Data Ingestion, Transformation, Warehousing, Machine Learning & data analytics concepts using AWS Data Lakes and Snowflake
* Good understanding of software testing tools such as Selenium, JMeter, Junit, PyTest
* Bachelor's degree in Information Technology, Computer Science or related field
* EKS Administration
* Kubernetes certification
* Global team - flexibility needed outside working business hours (dedicated on-call schedule)
* Strong communication - will have Scrum Master responsibilities including meeting with other Scrum Masters every 3 weeks to discuss dependencies and completed tasks towards overall progress
Nice to Have Skills & Experience
* DevOps certification
* Experience working with Splunk log indexing, creating usage metrics dashboards and Datadog APM dashboards
* Experience in automation of manual tasks or create self-service utilities using software such as Robot framework
Benefit packages for this role will start on the 31st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.