Job Description
Insight Global is seeking a for a Backend Engineer with Kubernetes infrastructure knowledge to own the design, automation, and reliability of stateful workloads running on Azure AKS. This role focuses on building highly available, self‑healing systems for databases and storage‑backed services, eliminating manual intervention and “snowflake” configurations while meeting 99.99% uptime targets.
What You’ll Do
-Own the full lifecycle of Kubernetes StatefulSets, including provisioning, scaling, upgrades, and graceful failover
-Design and implement high‑availability architectures using pod anti‑affinity, topology spread constraints, and zonal resilience
-Optimize and tune Azure persistent storage (Premium/Ultra Disks, Azure NetApp Files) via CSI drivers
-Build and automate disaster recovery workflows, including snapshot, restore, and rapid state reconciliation
-Provision and manage stateful infrastructure using Terraform and infrastructure‑as‑code best practices
-Create observability and alerting for PV utilization, disk pressure, replication lag, and storage health
-Ensure cluster upgrades and node rotations happen with zero manual data migration
PR: $37-$45/hr
We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com.To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/.
Required Skills & Experience
-7+ Years of experience as a software engineer
-2 years of experience hands‑on with Kubernetes StatefulSets, PVCs, and CSI
-Strong experience operating stateful services in production (e.g., Postgres, ClickHouse, Elasticsearch)
-Cloud knowledge- Azure preferred
-Experience automating infrastructure using Terraform
-Strong scripting or development skills in Go, Python, or Bash
Nice to Have Skills & Experience
-Experience building GitOps workflows for complex, ordered deployments
-Background in custom controllers, operators, or lifecycle hooks (PreStop/PostStart)
-Experience with disaster recovery testing, RTO/RPO optimization, and snapshot automation
-Strong observability mindset (dashboards, alerts, SLOs for stateful systems)
-Prior ownership of large‑scale distributed systems on Kubernetes
Benefit packages for this role will start on the 1st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.