Senior AI DevOps/SRE
Company
EPAM Systems
Location
Rockaway, NJ
Type
Full Time
Job Description
We are currently seeking an experienced Senior AI DevOps/SRE to join our team.
In this pivotal role, you will collaborate closely with data scientists and software developers to ensure seamless integration and optimize the operational efficiency of our AI deployments. Your expertise will be pivotal in deploying, maintaining, and scaling our cutting-edge AI solutions, encompassing LLMs and RAG systems.
As a key team member, you will spearhead both traditional DevOps responsibilities and innovative approaches to MLOps.Your proactive involvement will be essential in driving the success of our AI initiatives and maximizing their impact across the organization.
Unlock the potential of remote work in Kyrgyzstan, giving you the flexibility to work from home or access our office in Bishkek.
Want more jobs like this?
Get jobs in Rockaway, NJ delivered to your inbox every week.
#LI-DNI#wca-senior-lead-ai-devops#Big-Data-6-KG#AI-Integration-vacancies-KG#May-Referral-Digest-KG
Responsibilities
- Implement and maintain CI/CD pipelines for AI and machine learning projects, ensuring robust deployment strategies and continuous integration
- Monitor and ensure the reliability, availability, and performance of AI applications, particularly those involving LLMs and RAG
- Collaborate with AI research teams to operationalize machine learning models and systems efficiently
- Develop and enforce best practices for version control, configuration management, and testing of AI-driven software solutions
- Utilize MLOps tools such as Kubeflow, MLflow, or TensorFlow Extended (TFX) to streamline the machine learning lifecycle from experimentation to production
- Implement monitoring solutions that track both system metrics and model performance to facilitate proactive issue resolution
- Participate in on-call rotations to support the operational health of critical systems, employing SRE principles to meet service-level objectives (SLOs) and reduce downtime
- Bachelor's degree in Computer Science, Engineering, or a related field
- Proven experience as a DevOps Engineer or SRE, with a strong background in software development and automation
- Expertise in deployment and management of LLMs, including technologies like RAG
- Proficient in CI/CD tools (Jenkins, GitLab CI, CircleCI) and infrastructure as code (Terraform, Ansible)
- Solid knowledge of container orchestration technologies (Kubernetes, Docker)
- Familiarity with MLOps tools and practices to support machine learning lifecycle management
- Experience with cloud services (AWS, GCP, Azure), particularly in AI/ML deployments
- Background in monitoring tools like Prometheus, Grafana, and ELK stack
- Understanding of Python, particularly in data science and machine learning contexts
- Certification in Kubernetes, AWS/GCP/Azure, or similar technologies
- We connect like-minded people::
- Delivering innovative solutions to industry leaders, making a global impact
- Enjoyable working environment, whether it is the vibrant office or the comfort of your own home
- Opportunity to work abroad for up to two months per year
- Relocation opportunities within our offices in 55+ countries
- Corporate and social events
- We invest in your growth::
- Leadership development, career advising, soft skills and well-being programs
- Certifications, including GCP, Azure and AWS
- Unlimited access to LinkedIn Learning, Get Abstract, O'Reilly
- Free English classes with certified teachers
- We cover it all::
- Monetary bonuses for engaging in the referral program
- Medical & family care package
- Six trust days per year (sick leave without a medical certificate)
- Coverage of psychology sessions of your choice
- Discounts for fitness clubs and sports programs
- Benefits package (sports activities, a variety of stores and services)
Date Posted
12/21/2024
Views
0
Similar Jobs
Pharmacy Technician / Pharm Tech Apprenticeship - Walgreens
Views in the last 30 days - 0
Walgreens is transforming its pharmacy technician roles into a more patientcentric environment As a Walgreens Pharmacy Technician or Apprentice youll ...
View DetailsDirector, Brand Creative Services - Ridgefield Park, NJ - Samsung Electronics America
Views in the last 30 days - 0
Samsung is seeking a Creative Director for a US leadership role responsible for creative strategies and initiatives to accelerate Samsungs brand marke...
View DetailsSummer Intern, Training & User Experience - The Port Authority of New York & New Jersey
Views in the last 30 days - 0
TEC Solutions is offering a 12week fulltime internship starting May 29 2025 The ideal candidate should have passion resourcefulness and curiosity Resp...
View DetailsSenior Manager, Fulfillment Engineering - Rent the Runway
Views in the last 30 days - 0
Rent the Runway RTR is a pioneering company in the fashion industry offering a circular fashion model through subscription rental or ownership Founded...
View DetailsResearch Scientist - Cleaning Verification Specialist - Thermo Fisher Scientific
Views in the last 30 days - 0
Thermo Fisher Scientific is seeking a Senior Scientist for their Analytical Regulated Testing GMP team The role involves performing pharmaceutical man...
View DetailsAdministrative Assistant (Temp to Perm) - Lord Abbett
Views in the last 30 days - 0
Lord Abbett an independent investment management firm founded in 1929 is seeking a highly motivated Administrative Assistant The role involves providi...
View Details