DevOps Engineer-AI Distributed Platform

Samsung SDS America · South Bay

Company

Samsung SDS America

Location

South Bay

Type

Full Time

Job Description

Samsung SDS America is looking for a DevOps Engineer who has experience in writing Kubernetes services in Python language. We are at the forefront of innovation in creating intelligent and interactive machines. Samsung's perspective on Artificial Intelligence is to build an ecosystem that is user-centric rather than device-centric.

We work on distributed deep learning at scale across hundreds of GPU nodes to accelerate and automate deep learning workflows in a Kubernetes environment - on premises and in the cloud. We specialize in making deep neural networks work best on GPU High Performance Computing clusters; we love to work on scaling, breakthrough performance and record shattering benchmarks.

Samsung SDS is the digital arm of the Samsung group and a global provider of cloud and digital transformation innovations. Samsung SDS delivers enterprise-grade solutions and services in cloud, secure mobility, analytics / AI, digital marketing and digital workspace. We enable our customers in government, financial services, healthcare, and other industries to drive business in a hyper-connected economy helping them to increase productivity, safeguard assets, and make smarter decisions.

Responsibilities:

  • Collaborate with internal teams to design, deploy, configure and maintain Kubernetes clusters to support AI workloads.
  • Maintain and Enhance our Library API for Kubernetes.
  • Maintain and Enhance our container based server application invoking our Kubernetes Library.
  • Serve as our main Kubernetes Expert for all items related to Kubernetes.
  • Debug and Troubleshoot our Kubernetes based cluster application.
  • Become an expert in our Kubernetes application deployment written in Bash Scripting.
  • Build a new Python based Kubernetes Application Deployment Tool.
  • Enhance Production Debugging Capability (such as run time traces) of our Kubernetes Application.
  • Write server side code for Kubernetes and NVIDIA GPU business logic as REST API in Python.
  • Deploy new Kubernetes cluster and applications.
  • Manage and maintain CI/CD pipelines for various software applications.
  • Write distributed server framework.
  • Write code in a manner that does not fail in production.
  • Above all, deliver very high-quality code that can be maintained in production.

Requirements

  • Bachelor's degree in Computer Science, or a related field.
  • 5+ years of experience of writing commercial server side software is required.
  • 3+ years of Kubernetes Experience is required.
  • 3+ years of Python programming Experience is required. An equivalent experience with go language may be substituted.
  • Kubernetes deployment and hands on experience is required.
  • Prior experience of authoring Kubernetes internals, scheduling, and plugins.
  • Strong experience with cloud infrastructure platforms such as AWS, Azure, or Google Cloud Platform.
  • Strong experience with containerization technologies such as Docker and Kubernetes.
  • Prior experience with configuration management tools such as Ansible, Chef, or Puppet.
  • Experience deploying and maintaining microservices architectures.
  • Experience with Distributed System Design.

Preferred Qualifications:

  • Prior Experience with Dockers.
  • Prior Experience with TensorFlow and PyTorch frameworks.
  • Prior experience with Artificial Intelligence and Machine Learning.

The base pay range for this role is USD $157,000 - $195,000 per year.

Benefits

Samsung SDSA offers a comprehensive suite of programs to support our employees:

  • Top-notch medical, dental, vision and prescription coverage
  • Wellness program
  • Parental leave
  • 401K match and savings plan
  • Flexible spending accounts
  • Life insurance
  • Paid Holidays
  • Paid Time off
  • Additional benefits

Samsung SDS America, Inc. is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, race, color, religion, sex, sexual orientation, gender identity or expression, national origin, disability, status as a protected veteran, marital status, genetic information, medical condition, or any other characteristic protected by law.

Date Posted

06/26/2023

Views

0

Back to Job Listings Add To Job List Company Profile View Company Reviews
Positive
Subjectivity Score: 0.8

Similar Jobs

AI Solution Manager, ServiceNow Platform - ServiceNow

Views in the last 30 days - 0

ServiceNow a global market leader in AIenhanced technology is seeking an AI Solution Manager to lead the implementation of AI solutions for complex bu...

View Details

Staff Flight Test Engineer - Wisk

Views in the last 30 days - 0

Wisk Aero is seeking a Staff Flight Test Engineer to join their team in Hollister CA The role involves ensuring safe and efficient flight testing and ...

View Details

Senior Developer, Data Engineer - Tarana Wireless, Inc.

Views in the last 30 days - 0

Tarana is seeking a Senior DeveloperData Engineer with 5 years of experience in building largescale data pipelines The role involves designing buildin...

View Details

Staff Engineer, System Design Verification Engineering - Western Digital

Views in the last 30 days - 0

Western Digital is seeking a validation engineer to define and track test plans characterize and optimize SSDs and lead bug review meetings The ideal ...

View Details

Servo Development Engineer - Western Digital

Views in the last 30 days - 0

Western Digital a company with over 50 years of experience in data storage is seeking a skilled professional to optimize highperformance and robust po...

View Details

Senior Front-End Software Engineer - Percipient.ai

Views in the last 30 days - 0

Percipientai founded in 2017 is a cuttingedge technology company specializing in Computer Vision Artificial Intelligence and Deep Learning They develo...

View Details