Software Intern, GPU Monitoring and Diagnostics - Summer 2023

NVIDIA · Redmond

Company

NVIDIA

Location

Redmond

Type

Full Time

Job Description

NVIDIA is looking for outstanding software engineers to help us expand our enterprise GPU management and monitoring tools. In this role you will work closely with the broader NVIDIA team to design and build Linux-based management agents, CLI tools and end-to-end integration solutions that combine GPUs with the rest of the data center software management ecosystem. We are focused on supporting NVIDIA products across HPC, cloud and enterprise on both bare metal and virtualized platforms as the role of GPUs in all of these environments expands rapidly. Your contributions will span many aspects of GPU system integration, including telemetry and metrics, health checks, diagnostics, configuration, accounting and policy. These tools fill roles of both passive background monitoring and active online management with a core emphasis on operational transparency and seamless integration in customer environments. Your code will support single node developer systems through large clusters with thousands of nodes.

To be successful you will need to have a strong Linux C/C++ background, familiarity with distributed software development and a proven work ethic. You will be expected to jump in quickly and provide important contributions from day one. This is a dynamic work environment with many exciting opportunities awaiting. NVIDIA GPUs are central to many hot trends in the enterprise, cloud and datacenter. Come join us as we craft the future of accelerated compute and AI.

What you'll be doing:

  • Develop robust, scalable C++ user space data center management system software under Linux
  • Build and maintain user-space libraries, agents, plugins, bindings and CLI tools
  • Enable GPU management integration with the OSS ecosystem, including Kubernetes and Docker
  • Support internal and external users through bug fixes, documentation and feature improvements
  • Maintain high quality products through robust test coverage and smart design

What we need to see:

  • You are pursuing a Bachelors Degree in Computer Science or related field
  • Strong modern C++ development background
  • User space development and debugging expertise under Linux environments
  • Outstanding written and verbal interpersonal skills
  • Strong motivation and commitment to learn new skills
  • Ability to execute all aspects of the software development lifecycle
  • Ability to mange time in a fast, heavily multitasked environment

Ways to stand out from the crowd:

  • Development experience with python and go
  • Experience with containers, common orchestration frameworks and common logging/telemetry backends
  • Experience with APIs and interface design
  • Exposure to GPU programming with CUDA

NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most expert and versatile people in the world working for us. If you're hardworking, creative, and autonomous, we want to hear from you!

The hourly rate for our interns is $19 - $93. Our internship hourly rates are a standard pay determined based on the position and your location, year in school, degree, and experience.

You will also be eligible for Intern benefits.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Date Posted

01/14/2023

Views

11

Back to Job Listings Add To Job List Company Profile View Company Reviews
Positive
Subjectivity Score: 0.9

Similar Jobs

Software Engineer II, Graphics/Vulkan - DigitalFish

Views in the last 30 days - 0

DigitalFish is seeking a Software Engineer II Graphics to join their dynamic team The ideal candidate will have experience in realtime graphics and ma...

View Details

Sr. RF Silicon Software Engineer (Starlink) - SpaceX

Views in the last 30 days - 0

SpaceX is actively developing technologies to make human life on Mars possible and deploying Starlink the worlds largest satellite constellation provi...

View Details

Sr. Software Engineer, Starlink Ground Stations - SpaceX

Views in the last 30 days - 0

SpaceX is a company that aims to make human life on Mars possible by developing advanced technologies for a future of outdoor exploration They are cur...

View Details

Software Engineer, Starlink Ground Stations - SpaceX

Views in the last 30 days - 0

SpaceX is a company that aims to make human life multiplanetary by developing technologies for a future where humanity explores the stars They are cur...

View Details

Senior Software Engineer, Networking Software - NVIDIA

Views in the last 30 days - 0

NVIDIAs platforms have made significant impacts in AI and SoftwareDefined Networking with widespread use across leading academic institutions startups...

View Details

Supply Chain Planning Manager (Starlink) - SpaceX

Views in the last 30 days - 0

SpaceX is a company that aims to make life on Mars possible by developing a low latency broadband internet system using a constellation of low Earth o...

View Details