Job Description
AT&T is seeking a highly skilled and motivated Observability & Cloud Infrastructure Engineer to lead the migration from Azure Log Analytics to an open-source observability solution based on OpenSearch. This role will drive the design, implementation, and management of centralized OpenSearch clusters within Azure Kubernetes environments, while introducing new logging infrastructure and ensuring secure, high-performance operations across Azure and AWS platforms.
• Lead the migration of logging infrastructure from Azure Log Analytics to OpenSearch.
• Design, implement, and manage centralized OpenSearch clusters in Azure Kubernetes Service (AKS).
• Administer Kubernetes clusters in both on-prem and cloud environments.
• Develop and optimize logging pipelines using FluentBit, Kafka, Event Hub, and OpenSearch.
• Introduce and scale new observability infrastructure for multi-tenant environments.
• Collaborate with external vendors for support and troubleshooting.
• Manage compute and storage resources across Azure and AWS, focusing on cost-efficiency and performance.
• Implement private endpoints and secure networking between Azure and AWS environments.
• Monitor and maintain infrastructure to ensure high availability, scalability, and reliability.
We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com.To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/.
Required Skills & Experience
• Strong hands-on experience with Kubernetes (on-prem and cloud), OpenSearch/ElasticSearch, and Azure.
• Deep understanding of Azure Log Analytics and its role in observability pipelines.
• Proficiency in cloud platforms and networking integration.
• Solid understanding of data pipeline tools such as Kafka, Event Hub, FluentBit, and DataPrepper.
• Experience with observability tools and multi-tenant logging solutions.
• Strong scripting skills in Python or similar languages.
• Experience with infrastructure-as-code tools such as Terraform, HELM, and Ansible.
• Proven ability to administer Kubernetes clusters and manage infrastructure.
• Excellent analytical, problem-solving, and debugging skills.
• Strong communication and collaboration abilities.
• Ability to work independently and drive solutions with minimal oversight.
Nice to Have Skills & Experience
• Proven experience in observability engineering, data pipeline management, cloud infrastructure optimization, and solution architecture.
• Familiarity with ElasticSearch as an alternative or complement to OpenSearch.
• Experience working in hybrid cloud environments and with cross-functional teams.
Benefit packages for this role will start on the 31st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.