Lead MLOps Gen AI Engineer

JOB_REQUIREMENTS

Hires in

Not specified

Employment Type

Not specified

Company Location

Not specified

Salary

Not specified

EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.

We are looking for a Lead MLOps Gen AI Engineer to support the development and deployment of AI tools.

This role bridges data science, engineering, and cloud infrastructure, focusing on building scalable and automated production-grade AI systems. You will work closely with data scientists, business stakeholders, and cloud engineers to convert prototypes into dependable, high-performance applications that drive predictive analytics, generative AI solutions, and interactive dashboards for business leaders. If you have substantial experience in MLOps and cloud infrastructure and enjoy collaborating across teams, we encourage you to apply.

Responsibilities

Collaborate with data scientists to convert ML and generative AI experiments into scalable production pipelines
Develop and maintain shared code repositories and reusable components
Design and implement CI/CD pipelines in AWS or Azure environments for deploying models, APIs, and generative AI tools
Build and manage data pipelines and DataOps processes
Containerize applications using Docker and deploy them with cloud-native services
Automate infrastructure provisioning using Terraform and manage database schemas in Azure and Snowflake
Deploy and administer generative AI applications such as chatbots, retrieval-augmented generation (RAG) systems, and predictive tools
Implement monitoring, observability, and explainability solutions to ensure system stability and performance
Establish alerting, rollback strategies, and observability frameworks to maintain operational stability
Participate in code reviews and recommend improvements to workflows and processes

Requirements

Experience of 8 to 12 years in MLOps or DevOps engineering
Proven leadership experience in managing AI or ML deployment projects
Background in collaborating with data scientists and cloud engineers
Expertise in building CI/CD pipelines in AWS or Azure platforms
Skills in containerization using Docker and deploying applications in cloud environments
Proficiency in infrastructure automation using Terraform
Knowledge of managing data schemas in cloud data warehouses such as Azure and Snowflake
Understanding of generative AI fundamentals and applications
Capability to deploy and manage AI models and APIs in production

Nice to have

Familiarity with large language models (LLM)
Experience with retrieval-augmented generation (RAG) systems

We offer

Opportunity to work on technical challenges that may impact across geographies
Vast opportunities for self-development: online university, knowledge sharing opportunities globally, learning opportunities through external certifications
Opportunity to share your ideas on international platforms
Sponsored Tech Talks & Hackathons
Unlimited access to LinkedIn learning solutions
Possibility to relocate to any EPAM office for short and long-term projects
Focused individual development
Benefit package:
- Health benefits
- Retirement benefits
- Paid time off
- Flexible benefits
Forums to explore beyond work passion (CSR, photography, painting, sports, etc.)

Similar jobs