INTL- AI Operations Manager (Colombia)

Post Date

Feb 21, 2025

Location

Plano,
Texas

ZIP/Postal Code

75024
US
May 13, 2025 Insight Global

Job Type

Contract-to-perm

Category

Managerial / Professional

Req #

DAL-764463

Pay Rate

$30 - $37 (hourly estimate)

Job Description

An employer is looking for an AI Operations Manager to sit remotely. Your primary focus will be to lead an AI operations and support team responsible for ensuring the reliability, scalability, and efficiency of AI/ ML systems in production. You will oversee the 24/7 operations team, develop playbooks, establish/ track SLAs, and collaborate closely with data scientists, AI engineers, and machine learning engineers to define and maintain support strategies. You will build/ lead the operations team, establish staffing plans/ schedules/ rotations and build a team culture. You will design/ implement operational playbooks for AI system monitoring, troubleshooting, and issue resolution. You will lead incident response efforts, ensuring timely resolution and continuous improvement. Additional tasks will include leading the efforts to onboard new clients/ solutions, establishing new processes, build/ provide training materials/ documentation and maintain monitoring systems to proactively detect performance degradation and system failures.

We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com.

To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/ .

Required Skills & Experience

5+ years experience in AI/ ML operations, MLOps
2+ years experience in leadership roles over related teams
Strong background in AI/ ML systems, data pipelines, monitoring frameworks
Experience building playbooks/ runbooks and establishing SLAs, KPIs and metrics
Proficient in monitoring tools/ platforms (Prometheus, Grafana, Datadog, PagerDuty)
Familiar with cloud platforms (AWS, GCP, Azure) and container orchestration tools (Kubernetes, Docker)

Nice to Have Skills & Experience

Scripting experience (Python, Bash)
Knowledge on machine learning lifecycle management tools (MLflow, SageMaker, Kubeflow)

Benefit packages for this role will start on the 31st day of employment and include medical, dental, and vision insurance, as well as HSA, FSA, and DCFSA account options, and 401k retirement account access with employer matching. Employees in this role are also entitled to paid sick leave and/or other paid time off as provided by applicable law.