Senior Machine Learning Operations (MLOps) Engineer

Bonfy.AI Mountain View, CA

Company

Bonfy.AI

Location

Mountain View, CA

Type

Full Time

Job Description

Bonfy.AI is building the trust layer for generative AI. Our Adaptive Content Security platform detects and mitigates subtle risks embedded in large language model (LLM) outputs before they reach users. From hallucinations to hidden data leaks, we enable enterprises to deploy GenAI confidently, without compromising truth, privacy, or reputation. We are model-agnostic, outcome-driven, and unapologetically rigorous. Our customers include leading Fortune 500 teams working in high-stakes sectors where trust is not optional. Why This Role Matters We need an MLOps Engineer to optimize our GPU-accelerated ML infrastructure for performance and cost efficiency. Working with our existing Sr. DevOps and Sr. SRE teams, you'll focus on the specialized ML optimization challenges that require deep machine learning expertise. • GPU & Cost Optimization: Design optimal GPU configurations and ML deployment strategies to maximize performance while minimizing cloud costs. • ML Performance Tuning: Optimize model serving, memory management, and inference pipelines for production LLM workloads. You will also work with models and customize prompts, write pre- and post-processing methods to improve accuracy and speed (production coded), and implement new models functionality in the system. • DevOps Collaboration: Work with our Sr. DevOps/SRE teams to implement ML-specific solutions and monitoring What We're Looking For • ML Infrastructure Optimization: 5+ years optimizing production ML systems with focus on GPU utilization and cost management • GPU & LLM Expertise: Deep understanding of GPU architectures, memory management, and LLM inference optimization • Python + DevOps Integration: Expert Python programming with experience working alongside DevOps/SRE teams on ML-specific solutions • Bonus: Experience at GPU-focused ML companies (SambaNova, NVIDIA, etc.) or with high-performance ML serving frameworks Why Join Us • Collaborative Impact: Work with our existing Sr. DevOps and Sr. SRE teams to solve ML-specific challenges that require specialized expertise • Technical Depth: Focus purely on cutting-edge ML optimization problems without getting pulled into general infrastructure work • High Autonomy: Direct collaboration with engineering leadership in a fast-paced, technically rigorous environment • Competitive Package: Strong salary, equity, comprehensive benefits, and flexible hybrid work model Bonfy.AI — Truth. Security. Intelligence.
Apply Now

Date Posted

07/23/2025

Views

0

Back to Job Listings ❤️Add To Job List Company Info View Company Reviews
Neutral
Subjectivity Score: 0

Similar Jobs

Physician Assistant - Family Medicine - VillageMD

Views in the last 30 days - 0

View Details

Applied Scientist II, Prime Video - Personalization and Discovery Science - Amazon.com Services LLC

Views in the last 30 days - 0

Prime members can customize their viewing experience and find their favorite movies series documentaries and live sports including Amazon MGM Studios...

View Details

Software Engineer II - Disney Entertainment and ESPN Product & Technology

Views in the last 30 days - 0

Innovation We develop and implement groundbreaking products and techniques that shape industry norms and solve complex and distinctive technical probl...

View Details

Machine Learning & Data Scientist, OS Power & Performance - Apple

Views in the last 30 days - 0

In this role you will analyze high dimensional data to derive meaningful insights and be responsible for producing metrics models simulations and tool...

View Details

Multimodal Generative Modeling Engineer - Apple

Views in the last 30 days - 0

As a multimodal generative modeling engineer in our team you will be responsible for developing machine learning technologies implementing and optimiz...

View Details

Senior/Lead Backend Engineer - Crossbar

Views in the last 30 days - 0

As a senior technical leader youll partner with SDK infra and product teams to create backend services that scale to millions of users while maintaini...

View Details