Senior Engineer: Infrastructure Automation (ML Systems)

CoreWeave · Other US Location

Company

CoreWeave

Location

Other US Location

Type

Full Time

Job Description

CoreWeave is a specialized cloud provider, delivering a massive scale of GPU compute resources on top of the industry’s fastest and most flexible infrastructure. CoreWeave builds cloud solutions for compute intensive use cases — VFX and rendering, machine learning and AI, batch processing, and Pixel Streaming — that are up to 35 times faster and 80% less expensive than the large, generalized public clouds. Learn more at www.coreweave.com.

About the role:

The ML Interfaces Team plays a key role in CoreWeave’s customer journey as the team responsible for supporting and innovating the interfaces our customers use to schedule and drive their machine learning workloads.Initially supporting our Slurm and Knative interfaces, this team has a mandate to keep their fingers on the pulse of the Machine Learning community and provide commoditized solutions targeted to their needs that reduce friction and increase the efficiency and reliability of consuming CoreWeave’s world-class GPU cloud.

We are seeking a Senior Infrastructure Automation Engineer to join the ML Interfaces Team and help us build the interfaces of consumption that CoreWeave’s customers need in order to be successful. This individual will join a team of 4-6 mixed-skill engineers and have the opportunity to work on the full gamut of rewarding challenges that come with the business of building a cloud in a communicative, supportive, and high-performing environment. As a member of the ML Interfaces Team, you would have the opportunity to:

  • Identify and implement scalable and fault-tolerant interfaces for consuming GPU resources that are responsive to the needs and practices of the ML community.
  • Create test plans, deployment automation, dashboards, alerts, and insights into our product’s operations as well as participate in the ML Interfaces on-call rotation.
  • Grow, change, invest in your teammates, be invested-in, share your ideas, listen to others, be curious, have fun, and, above all, be yourself.

Wondering if you’re a good fit? We believe in investing in our people, and value candidates who can bring their own diversified experiences to our teams – even if you aren't a 100% skill or experience match. Here are some qualities we’ve found compatible with our team. If a portion of this resonates with you, we’d love to talk. 

  • You have four or more years of experience in a software engineering industry with a specialization in developing and troubleshooting distributed systems in production and at scale.
  • You have a drive to learn and grow in a rapidly evolving technology space and have interest or experience in some of the current core technologies supported by the team such as Slurm, KNative, and/or Istio.
  • You are comfortable with the idea of using Go as your primary programming language and are capable of navigating a Linux operating environment.
  • You have some experience using Kubernetes with a conceptual understanding of its major components and ingress/service meshes.
  • You can transform problems in elastic architectures, decompose them into achievable tasks, and socialize both to your teammates.
  • You’re interested in reliability engineering concepts such as the different types of testing, progressive deployments, error budgets, the role observability, and fault-tolerant design.

Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $165,000/year in our lowest geographic market up to $220,000/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.  

Why CoreWeave?

At CoreWeave, we work hard, have fun, and move fast!  We’re in an exciting stage of hyper-growth that you will not want to miss out on. We’re not afraid of a little chaos, and we’re constantly learning. Our team cares deeply about how we build our product and how we work together, which is represented through our core values: 

  • Be Curious at your Core
  • Act like an Owner
  • Empower Employees
  • Deliver Best In-Class Client Experience 
  • Achieve More Together

We support and encourage an entrepreneurial outlook and independent thinking. We foster an environment that encourages collaboration and provides the opportunity to develop innovative solutions to complex problems. As we get set for take off, the growth opportunities within the organization are constantly expanding. You will be surrounded by some of the best talent in the industry, who will want to learn from you, too. Come join us! 

Benefits

We offer a competitive salary and benefits, including:

  • Medical, dental and vision insurance - 100% paid for the employee
  • Life Insurance 
  • Short and long-term disability insurance 
  • Flexible Spending Account
  • Flexible, full-service childcare support with Kinside
  • 401(k) with a generous employer match
  • Flexible PTO
  • Catered lunch each day in our offices
  • Weekly massages in NJ office
  • A casual work environment
  • Work culture focused on innovative disruption

California Consumer Privacy Act - California applicants only

CoreWeave is an equal opportunity employer, committed to our diversity and inclusiveness. We will consider all qualified applicants without regard to race, color, nationality, gender, gender identity or expression, sexual orientation, religion, disability or age.


Apply Now

Date Posted

09/12/2023

Views

14

Back to Job Listings Add To Job List Company Profile View Company Reviews
Positive
Subjectivity Score: 0.9

Similar Jobs

Senior Software Engineer (Scala/Java) - HERE Technologies

Views in the last 30 days - 0

HERE Technologies is seeking an experienced backend engineer with strong Java or Scala skills to join the Map Processing Pipelines team The role invol...

View Details

Software Architecture Engineering and Cloud Computing Engineer - The Aerospace Corporation

Views in the last 30 days - 0

The Aerospace Corporation is seeking a Senior Project Engineer with expertise in software architecture engineering and cloud computing The role involv...

View Details

Senior Finance Business Partner (d/f/m) - Personio

Views in the last 30 days - 0

Personio an intelligent HR platform is seeking a Senior Manager for FPA to lead financial planning and analysis for key departments The ideal candidat...

View Details

Senior Lead, Talent Acquisition - Sales (Relocation to Munich) (d/f/m) - Personio

Views in the last 30 days - 0

Personio a leading HR platform is seeking a Senior Lead Talent Acquisition professional to drive growth in the Revenue and Success functions across Eu...

View Details

Senior Pricing Analyst - Cencora

Views in the last 30 days - 0

Cencora formerly known as AmerisourceBergen is a leading global pharmaceutical solutions organization They are currently experiencing rapid growth in ...

View Details

Senior Product Analyst - FinCrime Platform - WISE

Views in the last 30 days - 0

Wise is seeking a Senior Product Analyst for its FinCrime Platform The role involves driving analytics efforts in the Financial Crime Platform product...

View Details