Software Engineer, Systems ML - HPC Specialist

Meta Remote

Company

Meta

Location

Remote

Type

Full Time

Job Description

Meta is seeking an AI Software Engineer to join our Research & Development teams. The ideal candidate will have industry experience working on AI Infrastructure related topics. The position will involve taking these skills and applying them to solve for some of the most crucial & exciting problems that exist on the web.Some aspects of this role as an HPC specialist may include authoring components such as cuBLAS, cuDNN, AITemplate, FlashAttention and development of runtimes such as LLM disaggregated runtime. HPC specialists spend time optimizing the program to reduce the accelerators idle time. They also develop tools to debug (cuda-gdb), profiler utilizing the accelerated computing hardware (such as PE's/SFU etc in MTIA or Transformer engine in H100). They are experts in systems who are able to design, debug and accelerate AI workloads from single-node scale up to multi-node scale out distributed systems. They also are able to influence the next generation of Silicon architectures (such as Tensor Core in V100. Transformer Engine in H100) based on the evolving AI workload needs.We are hiring in multiple locations.

Want more jobs like this?

Get jobs that are Remote delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.


Software Engineer, Systems ML - HPC Specialist Responsibilities:
  • Apply relevant AI and machine learning techniques to build & optimize our intelligent systems that improve Metas products and experiences
  • Develop custom/novel architectures, define use cases, and develop methodology & benchmarks to evaluate different approaches
  • Apply in depth knowledge of how the machine learning system interacts with the other systems around it
  • Assist in goal setting related to project impact, AI system design, and ML excellence
Minimum Qualifications:
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience.
  • 2+ years of experience in HPC and parallel computing.
  • Proficiency in GPU programming using CUDA and familiarity with CUDA libraries (cuBLAS, cuDNN, etc.).
  • Proven track record of leading successful HPC projects.
  • Proven technical expertise in HPC architectures and technologies.
Preferred Qualifications:
  • PhD in Computer Science, Computer Engineering, or relevant technical field.
  • Experience developing AI algorithms or AI-System infrastructure in C/C++ or Python.
  • Experience developing AI Compiler (TorchInductor in PyTorch 2.0).
About Meta:

Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today-beyond the constraints of screens, the limits of distance, and even the rules of physics.

Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment.

Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at [email protected].

$70.67/hour to $208,000/year + bonus + equity + benefits

Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.

Apply Now

Date Posted

12/24/2024

Views

0

Back to Job Listings ❤️Add To Job List Company Info View Company Reviews
Positive
Subjectivity Score: 0.8

Similar Jobs

Senior Software Engineer - (Java) - Canva AI - Canva

Views in the last 30 days - 0

This job description highlights a Senior Backend Engineer role at Canva AI Group emphasizing opportunities to shape AIpowered platforms drive scalable...

View Details

Software Engineer III - MeridianLink

Views in the last 30 days - 0

This job description outlines the responsibilities and qualifications for a Software Engineer III role emphasizing software development system design ...

View Details

Distributed Systems Engineer - LiveKit

Views in the last 30 days - 0

LiveKit is revolutionizing the AI landscape with robust infrastructure supporting over 3 billion calls annually and 200000 developers The role offers ...

View Details

Sr ML Engineer - Robotics - Diligent Robotics

Views in the last 30 days - 0

The text describes a Sr ML Engineer role focused on developing AI for robots emphasizing collaboration realworld applications and technical expertise ...

View Details

HR Specialist - G-P

Views in the last 30 days - 0

This job description outlines an HR Specialist role in the Employer of Record industry emphasizing global expansion workforce management and complianc...

View Details

Customer Support Engineer - Snowplow Analytics

Views in the last 30 days - 0

Snowplow seeks a Customer Support Engineer in Colombia offering remote work AI tech engagement and a competitive package The role involves technical s...

View Details