Sr. Engineering Manager - ML Platform

Databricks · Other US Location

Company

Databricks

Location

Other US Location

Type

Full Time

Job Description

At Databricks, we are passionate about enabling data teams to solve the world's toughest problems โ€” from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. We do this by building and running the world's best data and AI infrastructure platform so our customers can use deep data insights to improve their business.ย 

Databricks Model Serving provides customers with a robust, reliable system to serve ML models at high QPS and low latency using GPU acceleration. We leverage our serverless architecture to provide superior performance than competitors for LLMs out of the box. Customers also receive greater agility and operability with the integrated data lakehouse, allowing them to iterate on feature engineering and easily monitor model performance.

We are seeking a dedicated Senior Engineering Manager to lead our initiatives around LLM performance, data plane reliability & scalability, and overall margins. You will additionally be one of the senior leaders of the ML Platform and craft overall strategy to build integrations with model monitoring, feature stores, vector dbs, and more.


Key responsibilities of the position include:

  • Leading a talented engineering team to deliver amazing performance and reliability at low cost
  • Recruit top-tier talent, build functional team structures, and coach the team to be a world class organization
  • Evolve processes to improve operational excellence and deliver on the roadmap
  • In collaboration with product management and IC leaders, create and iterate on a roadmap to align product goals with the Lakehouse strategy
  • Work closely with platform teams to build rock-solid infrastructure that can be leveraged by all serverless products at Databricks
  • Frequent customer interaction for support, sales, and product planning purposes

The impact you will have:

  • Lead development for the first real-time product at Databricks to serve over 1M QPS
  • Drive company wide impact by making Databricks the best place to create and deploy enterprise LLMs
  • Complete the Lakehouse AI story by making it super easy for customers to to iterate on features and debug production issues
  • Grow a world class team of software engineers working on our data plane from 10 to 20 over the next 18 months, hire top-notch talent including up to the Staff+ level
  • Ensure consistent delivery against milestones and strong alignment with the field working "two-in-a-box" with product leadership
  • Evolve organizational structure to align with long term initiatives, build strong "5 ingredient" teams with good comms architecture
  • Manage technical debt, including long term technical architecture decisions and balance product roadmap

Minimum requirements for the position include:

  • 3+ years of technical management experience, including managing other managers and engineers at the Staff+ level
  • 8+ years of experience working on highly-available multi-tenant systems with a focus on reliability and efficiency
  • Ability to attract, hire, and coach engineers who meet the Databricks hiring standards. Can up level the existing team via hiring top-notch senior talent, growing leaders and helping struggling members. Can gain trust of the team and guide their careers
  • Comfort working cross functionality with product management and directly with customers; ability to deeply understand product and customer personas

An ideal candidate will also have:

  • Experience working with techniques like quantization, pruning, interleaving, layer fusion, and writing custom CUDA kernels to improve model performance
  • Experience building products for real-time serving infrastructure for models, containers, functions, or similar
  • Experience building or supporting ML systems
  • Experience operating Kubernetes in production environments
  • Experience working with an ML Framework like PyTorch, TensorFlow, or similar

Pay Range Transparency

Databricks is committed to fair and equitable compensation practices. The pay range(s) for this role is listed below and represents base salary range for non-commissionable roles or on-target earnings for commissionable roles.ย  Actual compensation packages are based on several factors that are unique to each candidate, including but not limited to job-related skills, depth of experience, relevant certifications and training, and specific work location. Based on the factors above, Databricks utilizes the full width of the range. The total compensation package for this position may also include eligibility for annual performance bonus, equity, and the benefits listed above. For more information regarding which range your location is in visit our page here.


Local Pay Range
$222,000โ€”$300,000 USD

About Databricks

Databricks is the data and AI company. More than 9,000 organizations worldwide โ€” including Comcast, Condรฉ Nast, and over 50% of the Fortune 500 โ€” rely on the Databricks Lakehouse Platform to unify their data, analytics and AI. Databricks is headquartered in San Francisco, with offices around the globe. Founded by the original creators of Apache Sparkโ„ข, Delta Lake and MLflow, Databricks is on a mission to help data teams solve the worldโ€™s toughest problems. To learn more, follow Databricks on Twitter, LinkedIn and Facebook.


Our Commitment to Diversity and Inclusion

At Databricks, we are committed to fostering a diverse and inclusive culture where everyone can excel. We take great care to ensure that our hiring practices are inclusive and meet equal employment opportunity standards. Individuals looking for employment at Databricks are considered without regard to age, color, disability, ethnicity, family or marital status, gender identity or expression, language, national origin, physical and mental ability, political affiliation, race, religion, sexual orientation, socio-economic status, veteran status, and other protected characteristics.


Compliance

If access to export-controlled technology or source code is required for performance of job duties, it is within Employer's discretion whether to apply for a U.S. government license for such positions, and Employer may decline to proceed with an applicant on this basis alone.

Apply Now

Date Posted

09/02/2023

Views

4

Back to Job Listings Add To Job List Company Profile View Company Reviews
Positive
Subjectivity Score: 0.8

Similar Jobs

Software Engineering Manager - Cargill

Views in the last 30 days - 0

The Software Engineering Manager job involves setting goals for a team responsible for software project development and delivery ensuring quality stan...

View Details

Software Architecture Engineering and Cloud Computing Engineer - The Aerospace Corporation

Views in the last 30 days - 0

The Aerospace Corporation is seeking a Senior Project Engineer with expertise in software architecture engineering and cloud computing The role involv...

View Details

Senior Product Analyst - FinCrime Platform - WISE

Views in the last 30 days - 0

Wise is seeking a Senior Product Analyst for its FinCrime Platform The role involves driving analytics efforts in the Financial Crime Platform product...

View Details

Sales Development Representative - UK (Remote) - Dscout

Views in the last 30 days - 0

Dscout is a company that specializes in experience research solutions helping innovative companies like Salesforce Sonos Groupon and Best Buy to build...

View Details

Intern People Experience - Personio

Views in the last 30 days - 0

Personio is an HR platform that simplifies complex tasks for small and mediumsized organizations With a team of over 1800 employees across Europe and ...

View Details

Senior Finance Business Partner (d/f/m) - Personio

Views in the last 30 days - 0

Personio an intelligent HR platform is seeking a Senior Manager for FPA to lead financial planning and analysis for key departments The ideal candidat...

View Details