Jobs at CoreWeave
288 open positions
Senior Engineer: Infrastructure Automation (ML Systems)
Company: CoreWeave
Location: Other US Location
Posted Sep 12, 2023
CoreWeave is seeking a Senior Infrastructure Automation Engineer to join the ML Interfaces Team. The role involves building scalable and fault-tolerant interfaces for consuming GPU resources, creating test plans, deployment automation, dashboards, alerts, and insights. The ideal candidate should have 4+ years of experience in software engineering, specializing in distributed systems, and be comfortable with Go and Linux. Knowledge of Kubernetes, Slurm, KNative, and Istio is beneficial. CoreWeave offers competitive compensation, comprehensive benefits, and a dynamic work environment.
Network Engineer, Datacenter
Company: CoreWeave
Location: Remote
Posted Sep 06, 2023
CoreWeave is a specialized cloud provider offering high-performance GPU compute resources for compute-intensive use cases like VFX, rendering, machine learning, AI, batch processing, and Pixel Streaming. They aim to deliver world-class network infrastructure with top-notch automation and modern architectural concepts. The company is seeking a Network Engineer with 3+ years of experience in network engineering, proficiency in routing protocols, Python/Shell scripting, and automation frameworks. CoreWeave offers competitive compensation, comprehensive benefits, and a dynamic work environment focused on innovation and collaboration.
Kernel Engineer
Company: CoreWeave
Location: Brooklyn
Posted Sep 15, 2023
We will consider all qualified applicants without regard to race, color, nationality, gender, gender identity or expression, sexual orientation, religion, disability or age. You will be surrounded by some of the best talent in the industry, who will want to learn from you, too. Come join us! BenefitsWe offer a competitive salary and benefits, including:Medical, dental and vision insurance - 100% paid for the employeeLife Insurance Short and long-term disability insurance Flexible Spending AccountFlexible, full-service childcare support with Kinside401(k) with a generous employer matchFlexible PTOCatered lunch each day in our officesWeekly massages in NJ officeA casual work environmentWork culture focused on innovative disruptionCalifornia Consumer Privacy Act - California applicants onlyCoreWeave is an equal opportunity employer, committed to our diversity and inclusiveness. We’re not afraid of a little chaos, and we’re constantly learning. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Why CoreWeave?At CoreWeave, we work hard, have fun, and move fast! We’re in an exciting stage of hyper-growth that you will not want to miss out on. As we get set for take off, the growth opportunities within the organization are constantly expanding. You will collaborate closely with cross-functional teams, up stack engineering teams, and stakeholders to ensure the successful delivery of highly performant and reliable software solutions.Kernel Hardware - Acceleration - Virtualization - Operating Systems - Containerization - KubeletOur Team’s Stack:Linux Kernel (custom build, currently tracking Ubuntu HWE)Intel/AMD CPUs, Nvidia GPUs, DPUs, Infiniband and Ethernet NICsKubeVirt, QEMU, SR-IOV, vfio-pciUbuntu 22.04Containerd, KubeletResponsibilities:Develop and maintain tooling to build custom Linux kernels and stateless OS imagesAutomate packaging of critical components (drivers, microcode, components with out-of-tree patches, etc)Serve as a senior point of contact for hardware issue escalation and troubleshootingCollaborate with cross-functional teams to define Linux and OS requirements, specifications, and system architectureAnalyze and optimize the performance of bare-metal and virtualized systems, identify bottlenecks, and propose improvements for enhanced efficiencyRequirements:Must have at least 5 years of professional experience maintaining large fleets of Linux serversDeep professional experience with troubleshooting and debugging hardware, OS, and kernel issuesHistory of improving system efficiency within different subsystems (network, storage, security)Strong familiarity with sysctls, cgroups, iommu, init systems, seccomp/apparmorAbility to effectively prioritize and communicate proposed features and fixesStrong passion for automation, with a commitment to automating processes comprehensivelyExcellent documentation skills and attention to detailStrong analytical and problem-solving abilitiesNice-to-haves:Experience with kexec, kpatch, kdumpExperience building CI/CD pipelines (GitHub or GitLab)Opinions about software version control and team collaboration Experience writing software testsOur compensation reflects the cost of labor across several US geographic markets. The team’s primary responsibilities include maintaining a custom Linux kernel, various OS images (Ubuntu-based), the virtualization stack (kubevirt/qemu/vfio), and the container/pod runtime stack (containerd/nydus/kubelet). Our team cares deeply about how we build our product and how we work together, which is represented through our core values: Be Curious at your CoreAct like an OwnerEmpower EmployeesDeliver Best In-Class Client Experience Achieve More TogetherWe support and encourage an entrepreneurial outlook and independent thinking. In this role, you will play a crucial part in the design, development, and optimization of our bare-metal systems from POST through joining a Kubernetes cluster.
HPC Operations Engineer (Non-Traditional Schedule)
Company: CoreWeave
Location: Other US Location
Posted Sep 13, 2023
CoreWeave is a specialized cloud provider offering high-performance GPU compute resources for compute-intensive use cases like VFX, rendering, machine learning, AI, batch processing, and Pixel Streaming. The company is seeking experienced problem solvers to join their HPC Operations team, responsible for provisioning, managing, and maintaining their expanding fleet of server nodes. Key responsibilities include installing, configuring, and maintaining large-scale supercomputing clusters, troubleshooting hardware and software issues, monitoring system performance, and collaborating to improve team processes and efficiency. CoreWeave offers competitive compensation, benefits, and a dynamic work environment focused on innovation and collaboration.
Engineering Manager: ML Interfaces
Company: CoreWeave
Location: Remote
Posted Sep 07, 2023
CoreWeave is a cloud provider that delivers massive scale GPU compute resources. They are seeking an engineering leader to build and guide a new team of engineers. The company values diversity, inclusiveness, and innovation, and offers competitive benefits and a casual work environment.
Senior Network Developer
Company: CoreWeave
Location: Remote
Posted Sep 19, 2023
We will consider all qualified applicants without regard to race, color, nationality, gender, gender identity or expression, sexual orientation, religion, disability or age. We’re not afraid of a little chaos, and we’re constantly learning. You will be surrounded by some of the best talent in the industry, who will want to learn from you, too. Come join us! BenefitsWe offer a competitive salary and benefits, including:Medical, dental and vision insurance - 100% paid for the employeeLife Insurance Short and long-term disability insurance Flexible Spending AccountFlexible, full-service childcare support with Kinside401(k) with a generous employer matchFlexible PTOCatered lunch each day in our officesWeekly massages in NJ officeA casual work environmentWork culture focused on innovative disruptionCalifornia Consumer Privacy Act - California applicants onlyCoreWeave is an equal opportunity employer, committed to our diversity and inclusiveness. As we get set for take off, the growth opportunities within the organization are constantly expanding. K8’s is life. If you go to bed thinking about how to do this, this is the place for you.-10 Points if you commit directly to mainA great attitude, without an ego, and a willingness to help those more junior, and learn from those more senior.Nice-to-haves:5+ years experience directly or indirectly helping deploy & manage monitoring of large networksKnowledge of the 11 herbs and spices in the KFC recipeWhy CoreWeave?At CoreWeave, we work hard, have fun, and move fast! We’re in an exciting stage of hyper-growth that you will not want to miss out on. Your day to day will consist of writing code, creating dashboards, writing alerts, and working closely with our Infra Network Engineering teams to build, operate, and monitor CoreWeave’s network. gNMI or bust!Prometheus, Grafana, Alert Manager, gNMI, gRPC/Rest APIs, and SNMPDeep understanding and working knowledge of Linux+10 points if you have so much home automation that you can't even remember what a light switch feels likeFor Loop Engineering for the win! Our team cares deeply about how we build our product and how we work together, which is represented through our core values: Be Curious at your CoreAct like an OwnerEmpower EmployeesDeliver Best In-Class Client Experience Achieve More TogetherWe support and encourage an entrepreneurial outlook and independent thinking. Your goal is to make our network so highly automated and intelligent, that you forget it’s even there.Qualifications3+ years experience in the following areas:Exposure to multiple networking vendorsMellanox, Nokia, Juniper, Arista, CiscoNVIDIA/Mellanox/Nokia is our bread and butter, this is your target for monitoringExperience supporting large infrastructure projectsA love for the following tools and protocols:SNMP is last for a reason.
Inventory Control Specialist - LGA
Company: CoreWeave
Location: New York City, NY
Posted Sep 16, 2023
We will consider all qualified applicants without regard to race, color, nationality, gender, gender identity or expression, sexual orientation, religion, disability or age. You will be surrounded by some of the best talent in the industry, who will want to learn from you, too. Come join us! BenefitsWe offer a competitive salary and benefits, including:Medical, dental and vision insurance - 100% paid for the employeeLife Insurance Short and long-term disability insurance Flexible Spending AccountFlexible, full-service childcare support with Kinside401(k) with a generous employer matchFlexible PTOCatered lunch each day in our officesWeekly massages in NJ officeA casual work environmentWork culture focused on innovative disruptionCalifornia Consumer Privacy Act - California applicants onlyCoreWeave is an equal opportunity employer, committed to our diversity and inclusiveness. We’re not afraid of a little chaos, and we’re constantly learning. As we get set for take off, the growth opportunities within the organization are constantly expanding. You will be responsible for recording and tracking onsite assets, managing logistics, conducting audits, and ensuring that our equipment and resources are effectively utilized within your region. Our team cares deeply about how we build our product and how we work together, which is represented through our core values: Be Curious at your CoreAct like an OwnerEmpower EmployeesDeliver Best In-Class Client Experience Achieve More TogetherWe support and encourage an entrepreneurial outlook and independent thinking. This role requires a strong attention to detail, excellent communication skills, and the ability to work collaboratively with cross-functional teams.Responsibilities:Asset Tracking: Maintain an accurate inventory of all hardware and other IT assets within the data center region, including servers, networking equipment, and other hardware and materials.Logistics Management: Coordinate the shipping and receiving of IT materials and ensure their safe storage and distribution within the data center and to other facilities.Audits: Conduct ongoing audits of the asset inventory to verify accuracy and completeness, and make necessary updates to the inventory records.Resource Allocation: Collaborate with the operations team to allocate resources efficiently, ensuring that hardware and materials are available when needed and optimizing utilization.Documentation: Keep detailed records of inventory, shipments, and audits, and provide regular reports to management.Technology Skills: Utilize inventory management software and other tools to maintain accurate records.Communication: Maintain open and effective communication with various teams, including Operations, IT, Procurement, and Finance to ensure smooth workflow.Problem Solving: Identify and resolve discrepancies in inventory records and take proactive measures to prevent inventory-related issues.Travel: Be willing to travel (roughly 25%) within the designated region as needed to support inventory management and audits at various data center locations.Qualifications:Proven experience in inventory management or a related field.Strong proficiency in Microsoft Excel.Familiarity with asset management softwareExcellent organizational and problem-solving skills.Detail-oriented with a high level of accuracy.A curious nature to identify and solve problemsEffective communication and teamwork skills.Ability to adapt to a dynamic and fast-paced startup environment.Comfortable working in a data center environment, and ability to move and lift heavy objectsCapable of flexing and pivoting as priorities shiftA passion for technology and a willingness to learn about the latest advancements in cloud compute services.If you are a motivated individual who thrives in a fast-paced environment and is excited about the opportunity to contribute to the success of a growing startup we encourage you to apply!Why CoreWeave?At CoreWeave, we work hard, have fun, and move fast! We’re in an exciting stage of hyper-growth that you will not want to miss out on. If you are passionate about technology, logistics, and ensuring efficient asset management, we invite you to be a part of our exciting journey.As an Inventory Control Specialist at CoreWeave you will be a critical contributor to the efficient operation of our data centers. We foster an environment that encourages collaboration and provides the opportunity to develop innovative solutions to complex problems.
Network Engineer, Datacenter
Company: CoreWeave
Location: Remote
Posted Sep 06, 2023
CoreWeave is a specialized cloud provider offering high-performance GPU compute resources for compute-intensive use cases like VFX, rendering, machine learning, AI, batch processing, and Pixel Streaming. They are up to 35 times faster and 80% less expensive than large public clouds. The company is seeking a Network Engineer with 3+ years of experience in network infrastructure, automation, and routing protocols. The role involves deploying datacenter fabrics, operationalizing deployed datacenters, and leveraging automation frameworks. CoreWeave offers competitive compensation, benefits, and a dynamic work environment focused on innovation and collaboration.
Senior Network Engineer, Datacenter
Company: CoreWeave
Location: Remote
Posted Sep 06, 2023
CoreWeave is a specialized cloud provider offering high-performance GPU compute resources for compute-intensive use cases like VFX, rendering, machine learning, AI, batch processing, and Pixel Streaming. They aim to deliver world-class network infrastructure with top-notch automation and modern architectural concepts. The company is seeking a Network Engineer with 7+ years of experience in networking vendors, datacenter routing protocols, Python/Shell scripting, and automation frameworks. CoreWeave offers competitive compensation, benefits, and a dynamic work environment focused on innovation and collaboration.
HPC Operations Engineer
Company: CoreWeave
Location: Other US Location
Posted Sep 15, 2023
We will consider all qualified applicants without regard to race, color, nationality, gender, gender identity or expression, sexual orientation, religion, disability or age. We’re not afraid of a little chaos, and we’re constantly learning. Come join us! BenefitsWe offer a competitive salary and benefits, including:Medical, dental and vision insurance - 100% paid for the employeeLife Insurance Short and long-term disability insurance Flexible Spending AccountFlexible, full-service childcare support with Kinside401(k) with a generous employer matchFlexible PTOCatered lunch each day in our officesWeekly massages in NJ officeA casual work environmentWork culture focused on innovative disruptionCalifornia Consumer Privacy Act - California applicants onlyCoreWeave is an equal opportunity employer, committed to our diversity and inclusiveness. You will be surrounded by some of the best talent in the industry, who will want to learn from you, too. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience.Why CoreWeave?At CoreWeave, we work hard, have fun, and move fast! We’re in an exciting stage of hyper-growth that you will not want to miss out on. As we get set for take off, the growth opportunities within the organization are constantly expanding. Our team cares deeply about how we build our product and how we work together, which is represented through our core values: Be Curious at your CoreAct like an OwnerEmpower EmployeesDeliver Best In-Class Client Experience Achieve More TogetherWe support and encourage an entrepreneurial outlook and independent thinking. This individual will join a team of committed engineers working to deploy nodes as fast as they can be racked and turned on. Key Responsibilities:Install, configure, and maintain large-scale high-performance supercomputing clusters running state-of-the-art GPUsTroubleshoot hardware and software issues; escalate and coordinate as needed with data center, network and platform teams to drive resolutionMonitor and analyze system performance and take appropriate remediation actions for cloud healthApproach your work with flexibility and optimism anticipating shifting business and technical prioritiesCreate and maintain documentation of team processes, knowledge and best practices for system managementThink critically about your day-to-day work and work collaboratively to improve team processes and efficiencySuccessful candidates typically share the following skills and experience:2 or more years of experience troubleshooting or administering data center or on-prem infrastructure (servers, storage, network or a mix)Strong understanding of Linux system administration and networking conceptsAbility to troubleshoot hardware and software issues and perform system maintenance tasks consistently and reliablyBachelor’s degree in a related field or equivalent experience Ideal candidates may also have experience in one or more of these:Software development or scripting languages (bash, python, powershell, etc)Grafana, prometheus, promsql queries or similar observability platformsData center environments including server racks, HVAC systems, fiber traysKubernetes administrationOur compensation reflects the cost of labor across several US geographic markets. Playing a central role in CoreWeave’s growth strategy, this team is on the front line for configuration, updates and remote troubleshooting of our highest tier of supercomputing clusters and their networking, delivery platforms and tools dependencies. However, we are willing to look at remote candidates. About the role:The High Performance Computing Operations team is responsible for the day-to-day provisioning, management and uptime of CoreWeave’s ever-expanding fleet of server nodes.
HR Business Partner
Company: CoreWeave
Location: Other US Location
Posted Sep 06, 2023
CoreWeave is a specialized cloud provider seeking an experienced HR Business Partner to support its growth. The role involves managing HR programs, driving performance management, career development, employee recognition, engagement, and issue resolution. The ideal candidate should have a Bachelor's degree in HR or related field, proven experience as an HR Business Partner, strong knowledge of HR best practices, excellent interpersonal skills, and HR certification is a plus. CoreWeave offers a competitive salary, comprehensive benefits, and a dynamic work environment focused on innovation and collaboration.
Senior Director of Solutions Engineering
Company: CoreWeave
Location: Other US Location
Posted Sep 16, 2023
We will consider all qualified applicants without regard to race, color, nationality, gender, gender identity or expression, sexual orientation, religion, disability or age. You will be surrounded by some of the best talent in the industry, who will want to learn from you, too. Come join us! BenefitsWe offer a competitive salary and benefits, including:Medical, dental and vision insurance - 100% paid for the employeeLife Insurance Short and long-term disability insurance Flexible Spending AccountFlexible, full-service childcare support with Kinside401(k) with a generous employer matchFlexible PTOCatered lunch each day in our officesWeekly massages in NJ officeA casual work environmentWork culture focused on innovative disruptionCalifornia Consumer Privacy Act - California applicants onlyCoreWeave is an equal opportunity employer, committed to our diversity and inclusiveness. We’re not afraid of a little chaos, and we’re constantly learning. As we get set for take off, the growth opportunities within the organization are constantly expanding. You will be responsible for building, leading, and scaling a dynamic team of solutions engineers, working closely with customers, and driving innovation to deliver world-class GPU cloud solutions. Your extensive experience in creating and scaling similar organizations, particularly at startups, will be invaluable in this role.Key Responsibilities:Build and lead a high-performing Solutions Engineering organization, fostering a culture of innovation, customer-centricity, and collaboration.Develop and execute a strategic plan for the Solutions Engineering organization that aligns with the company's overall vision and goals.Engage directly with customers to understand their unique GPU cloud needs and challenges, and ensure that our solutions are tailored to meet those needs effectively.Collaborate with product management and engineering teams to provide input on product development based on customer feedback and market trends.Recruit, train, and mentor a team of solutions engineers, ensuring they have the skills and resources needed to succeed.Work closely with the solutions engineering team to design and architect GPU cloud solutions that address customer requirements and deliver exceptional performance.Maintain a deep understanding of GPU technologies, cloud computing, and industry trends, serving as a subject matter expert both internally and externally.Collaborate cross-functionally with sales, marketing, and customer support teams to drive the adoption of GPU cloud solutions and ensure a seamless customer experience.Define and track key performance metrics to measure the success and impact of the Solutions Engineering organization.Qualifications:Bachelor's degree in computer science, engineering, or a related field Proven experience (10+ years) in building and leading Solutions Engineering teams in the cloud computing industry.Deep knowledge of GPU technologies, cloud computing platforms and AI/ML frameworks.Startup experience is a significant plus, showcasing adaptability, resourcefulness, and the ability to thrive in a fast-paced environment.Exceptional leadership, communication, and interpersonal skills.Strong problem-solving abilities and a customer-centric mindset.Strategic thinking and the ability to drive results in a rapidly evolving market.If you are a visionary leader with a track record of building and scaling Solutions Engineering organizations, and you're excited about the prospect of leading this effort again, we encourage you to apply.Why CoreWeave?At CoreWeave, we work hard, have fun, and move fast! We’re in an exciting stage of hyper-growth that you will not want to miss out on. Our team cares deeply about how we build our product and how we work together, which is represented through our core values: Be Curious at your CoreAct like an OwnerEmpower EmployeesDeliver Best In-Class Client Experience Achieve More TogetherWe support and encourage an entrepreneurial outlook and independent thinking. We foster an environment that encourages collaboration and provides the opportunity to develop innovative solutions to complex problems. CoreWeave builds cloud solutions for compute intensive use cases — VFX and rendering, machine learning and AI, batch processing, and Pixel Streaming — that are up to 35 times faster and 80% less expensive than the large, generalized public clouds.