Staff Software Engineer - Capacity Engineering
Company
Location
Remote
Type
Full Time
Job Description
Pinterest is seeking a Staff Software Engineer Capacity Engineering focused on managing and optimizing the ML infrastructure. The team is responsible for efficiently managing one of the largest-scale cloud-native infrastructures in the world. This role is highly impactful as efficiency is an ongoing strategic priority for Pinterest. The role has direct visibility across Pinterest Engineering and with Engineering and company leadership. The team is looking for a candidate with a strong background in ML Infrastructure focusing on efficiency and optimization.
What you’ll do
-
Manage the ML hardware capacity that powers the models running at Pinterest
-
Improve the efficiency of ML Infrastructure at Pinterest
-
Build develop and mature profiling and optimization capabilities for ML Infrastructure at Pinterest scale
-
Collaborate with ML Platform Infrastructure Engineering and SRE teams in their mission to deliver highly available resilient secure and efficient ML foundations for Pinterest’s tech stack
What we’re looking for:
-
Deep understanding of GPU Architectures Pytorch etc.
-
Deep understanding of supporting parts of ML software stack like Scheduling Data and Storage
-
Hands on experience with shared platforms like Kubernetes
-
Strong technical and performance engineering skills to collaborate with stakeholders on complex and ambiguous technical challenges
-
Experience building and managing highly available distributed applications at scale
-
Proficiency in software development languages such as Java Python and C++
-
Excellent skills in communicating complex technical issues
-
Understanding of ML Models Kernels and optimization opportunities
-
Hands-on experience with large cloud-native multi-tenant platforms at Internet scale
-
Experience with AWS or similar cloud environments
-
Deep understanding of infrastructure capacity and performance
-
Bachelor’s degree in Computer Science Engineering or a related field or equivalent experience.
In-Office Requirement Statement:
-
We let the type of work you do guide the collaboration style. That means we're not always working in an office but we continue to gather for key moments of collaboration and connection.
-
This role will need to be in the office for in-person collaboration 1-2 times/quarter and therefore can be situated anywhere in the country.
Relocation Statement:
-
This position is not eligible for relocation assistance. Visit our PinFlex page to learn more about our working model.
#LI-REMOTE
#LI-JT1
Date Posted
11/11/2025
Views
0
Similar Jobs
Senior Software Engineer - Ninety
Views in the last 30 days - 0
Ninety seeks a Senior Software Engineer to join a supportive team offering flexibility comprehensive benefits and growth opportunities The role involv...
View DetailsData Engineer - Data Platform - Kraken
Views in the last 30 days - 0
Kraken promotes a missiondriven culture focused on crypto innovation and global adoption They seek skilled Data Engineers to build scalable data syste...
View DetailsSite Reliability Engineer - Podium
Views in the last 30 days - 0
The job posting outlines a Site Reliability Engineer role with responsibilities in system reliability collaboration and oncall duties It lists require...
View DetailsOffensive Security Engineer Agent Security - OpenAI
Views in the last 30 days - 0
OpenAIs Security team seeks a Principallevel Offensive Security Engineer to enhance their security posture through innovative attack simulations and s...
View DetailsEnterprise Account Executive - Level AI
Views in the last 30 days - 0
Level AI seeks an Enterprise Account Executive to revolutionize customer sales experiences through AI innovation The role involves strategic deals bui...
View DetailsTelephonic Case Manager - Sedgwick
Views in the last 30 days - 0
Sedgwick promotes a caring culture career growth and worklife balance They offer comprehensive benefits and a collaborative environment making it an a...
View Details