Sr ML Engineer - Distributed Systems

NinjaTech AI · Peninsula

Company

NinjaTech AI

Location

Peninsula

Type

Full Time

Job Description

We are NinjaTech AI: a new paradigm shift in human to machine conversational AI company focused on making people at work more productive. We use generative AI and our own LLM models and we are based out of Palo Alto, CA. The company is founded former Google & Meta executives/entrepreneurs with 30+ years of combined experience.

We are looking for a Machine Learning (ML) Engineer to help us create artificial intelligence products. Machine Learning Engineer responsibilities include creating machine learning models and retraining systems. To do this job successfully, you need exceptional skills in statistics and programming. If you also have knowledge of data science and software engineering, we'd like to meet you. Your ultimate goal will be to shape and build efficient self-learning applications.

Responsibilities

  • Propose and Develop distributed systems to scale our ML services (Training & Inference)
  • Apply ML and optimization technique to production systems to scale and improve performance of Natural Language systems in production
  • Develop ML and optimization models in a full lifecycle, from offline training to online serving and monitoring.
  • Collaborate with teams across the company to define ML roadmap.
  • Incorporate efficiency metrics into our day to day compute and alleviate performance bottlenecks
  • Design and implement infrastructure for orchestrating end to end machine learning lifecycles
  • Deploy scalable systems on major CSPs (Azure, AWS, GCP, etc)
  • Set up processes to monitor and continually improve efficiency, accuracy and performance of models
  • Software development including algorithm implementation, optimization, performance profiling, integration to production systems, testing and documentation

Requirements

  • BS/MS or PhD in CS, Computer Engineering, Networking, ML, AI, Engineering or equivalent
  • Proven experience as a Machine Learning Engineer or similar role
  • Deep understanding of distributed systems for ML applications
  • Demonstrated ability to apply DDP and MDP techniques for ML models
  • Working knowledge of cluster management and using distributed frameworks for messaging, workload orchestration, storage, etc such as Apache Kafka, Slurm, Zookeeper, Redis, Scala, MPI, OpenMP
  • Working knowledge of designing distributed autoscaling systems on at least one cloud service provider (AWS, GCP, Azure,etc)
  • 3+ years of designing and deploying large distributed optimized ML serving infra in a production environment
  • Ability to code and iterate independently on top of data infrastructure tools like Python, Spark, Jupyter notebooks, standard ML libraries, etc.
  • Take pride in taking ownership and driving projects to business impact.
  • Experience with the industry application of optimization and/or reinforcement learning is a big plus.
  • Data analytics and data engineering experience is a plus.Ability to work with technical and non-technical teams.
  • Master's degree or equivalent work experience in Computer Science, Mathematics, Engineering, Operation Research, or a closely related field.

Benefits

Full benefits

  • Health Care Plan (Medical, Dental & Vision)
  • Paid Time Off (Vacation, Sick & Public Holidays)
  • Free Food & Snacks
  • Stock Option Plan

Date Posted

06/26/2023

Views

9

Back to Job Listings Add To Job List Company Profile View Company Reviews
Positive
Subjectivity Score: 0.8

Similar Jobs

Senior Staff Simulation Engineer - Wisk

Views in the last 30 days - 0

Wisk Aero is seeking a Senior Staff Simulation Engineer to join their Flight Physics Vehicle Modeling FPVM team The role involves designing implementi...

View Details

Senior Simulation Software Integration Engineer - Wisk

Views in the last 30 days - 0

Wisk is seeking a Senior Simulation Software Integration Engineer to lead the integration of highfidelity simulation models develop advanced test fram...

View Details

Support Engineer - Pricefx

Views in the last 30 days - 0

Pricefx a leading SaaS Pricing Price Optimization Management provider is seeking a Tier 34 Support Engineer The role involves providing technical sup...

View Details

Avionics Mechanical Engineer (Harness) - Reliable Robotics Corporation

Views in the last 30 days - 0

Reliable Robotics is seeking an Avionics Mechanical Engineer to join their Vehicle Design and Integration team in Mountain View California The role in...

View Details

Sr. Flight Software Engineer (Verification) - Reliable Robotics Corporation

Views in the last 30 days - 0

Reliable Robotics is a team of missiondriven engineers developing safetyenhancing technology for aviation aiming to make air transportation safer more...

View Details

Build / Automation Engineer - Aechelon Technology

Views in the last 30 days - 0

Aechelon Technology Inc is seeking an experienced Build Automation Engineer to join their Software Infrastructure team in South San Francisco The role...

View Details