The infrastructure team provides a resource for Engineering to help diagnose production issues and provide guidance on improving the availability and performance of our applications. This position also develops systems, automation, and tools to help make it easier for Engineering teams to deploy services in a fast, automated and reliable fashion.
Build, scale and support high-availability Ubuntu Linux production and development systems in a public cloud environment.
Work with tools such as Jenkins, Ansible, Argo CD, Terraform, CloudFormation, Resource Manager and many more to ensure that our stack is well represented as Infrastructure as Code.
Manage and Improve security and availability monitoring for all services, ensure defined security policies are consistently implemented across all environments.
Deploy workloads to multiple cloud environments, proven experience with all of the core services within AWS, Azure or GCP, including instance management, IAM configuration, Database, Caching and general support/troubleshooting.
Have a developed understanding of the core components required to run Kubernetes and be able to build a cluster from scratch if needed.
Have perfected the fundamentals of load balancing, service mesh and always looking for ways to improve availability and uptime.
Maintain quality documentation for systems owned by the Infrastructure team.
Use monitoring tools to identify and resolve issues before they happen. Have familiarity with Prometheus.
Help other teams troubleshoot and solve failures and performance problems, participate in on-call rotations.
Have a passion for working with Go, Python, Rust or even Bash to build custom tools and improve system integration. Take code ownership to the next level and act as an advocate for writing code that aligns with industry best practice.
Have a solid grasp on networking fundamentals and can easily explain how DNS, DHCP and routing work in most environments.
We are a company committed to creating inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity employer that believes everyone matters. Qualified candidates will receive consideration for employment opportunities without regard to race, religion, sex, age, marital status, national origin, sexual orientation, citizenship status, disability, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to
HR@insightglobal.com. The EEOC "Know Your Rights" Poster is available
here.
To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy:
https://insightglobal.com/workforce-privacy-policy/ .
BS degree in Computer Science or equivalent experience.
Proven skills with Linux or UNIX systems and related protocols/software with 3+ years experience.
A command of Linux systems including troubleshooting, memory management, tuning, I/O subsystem, RAID, and security.
Experience with provisioning tools such as Ansible/Chef/Terraform.
Experience with Jenkins or other CI/CD tools.
Programming aptitude in Go, Python, and Bash.
Working knowledge of database systems such as MySQL or PostgreSQL.
Experience building and deploying Containers, including orchestration tools such as Kubernetes, Mesos, or Docker Swarm.
Experience with cloud providers (AWS, Azure, GCP)
Benefit packages for this role will start on the 31st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.