CUDA Kernel Optimization Specialist - AI Trainer
Job Description
Role Overview
Analyze and optimize GPU kernels for performance, efficiency, and hardware utilization. Use profiler metrics to guide kernel improvements. Review GPU kernel implementations to identify bottlenecks without needing extensive algorithmic background.
What You Will Do
Write, modify, and reason about C++17, Python, and GPU programming code. Apply CUDA, HIP, and shader programming expertise to improve performance outcomes. Document optimization decisions clearly.
Why It Might Be a Fit
Must have at least 1 year of professional or graduate-level research experience with GPUs. Strong understanding of GPU profiler performance metrics for kernel optimization. Ability to optimize GPU kernels without deep prior context on every algorithm.
Requirements
- Available to work at least 20 hrs/wk.
- Fluent in core C++ features through C++17.
- Working knowledge of Python and Git.
- Fluent in at least one GPU programming model like CUDA, HIP, Slang, HLSL, or GLSL.
- At least 1 year of professional or graduate-level research experience with GPUs.
- Strong understanding of GPU profiler performance metrics for kernel optimization.
- Ability to optimize GPU kernels without deep prior context on every algorithm.
Originally posted on Himalayas
Explore More
Date Posted
06/06/2026
Views
0
Similar Jobs
Senior Marketing Specialist (Growth & Conversion) (100% Remote – Chicago Area Pr - WIN Home Inspection
Views in the last 30 days - 0
View DetailsClient Experience Specialist (100%Remote – Chicago Area Preferred) - WIN Home Inspection
Views in the last 30 days - 0
View DetailsMicrosoft Dynamics 365 Presales Architect (Global Business Applications) - Quisitive
Views in the last 30 days - 0
View DetailsProfessional Services Business Development Manager (Sales) - Arista Networks
Views in the last 30 days - 0
View DetailsSenior Account Executive, Affiliate Brand Partnerships - StackCommerce
Views in the last 30 days - 0
View Details