Specialist, HPC Systems Research & Development

KLA · Other US Location

Company

KLA

Location

Other US Location

Type

Full Time

Job Description

Company Overview

KLA is a global leader in diversified electronics for the semiconductor manufacturing ecosystem. Virtually every electronic device in the world is produced using our technologies. No laptop, smartphone, wearable device, voice-controlled gadget, flexible screen, VR device or smart car would have made it into your hands without us. KLA invents systems and solutions for the manufacturing of wafers and reticles, integrated circuits, packaging, printed circuit boards and flat panel displays. The innovative ideas and devices that are advancing humanity all begin with inspiration, research and development. KLA focuses more than average on innovation and we invest 15% of sales back into R&D. Our expert teams of physicists, engineers, data scientists and problem-solvers work together with the world’s leading technology providers to accelerate the delivery of tomorrow’s electronic devices. Life here is exciting and our teams thrive on tackling really hard problems. There is never a dull moment with us.

Group/Division

KLA advanced computing Labs’ (ACL) mission in India is to deliver advanced parallel computing research and software architectures for AI + HPC + Cloud solutions to accelerate the performance of KLA's products. ACL explores high-risk approaches, pioneering technologies, and novel methods to accelerate KLA’s algorithms and contribute to KLA’s HPC technology roadmap. Located out of the IIT Madras Research Park in Chennai, India, we engage leading thinkers in academia, industry and KLA’s business units to create innovative parallel computing methods to enable KLA’s business growth.

Job Description

KLA’s AI Advanced Computing Labs is looking for an extraordinary HPC System R&D Engineer to join its team to develop system-level HPC technologies that would form the foundation of next-generation clusters used in KLA tools that leverage AI to push the boundaries of process control for conductor manufacturing. The technologies would be developed and demonstrated on on-prem clusters that serve as testbeds for next-generation KLA tools.

 

Your Day-to-day Roles

  • Expose limitations in existing solutions, based on clusters of CPUs & GPUs, to deploy AI-based solutions on on-prem & cloud infrastructures at scale.
  • Develop distributed frameworks and system-level solutions that enable scaling out image processing & AI loads from single GPU to multi-node clusters with multiple GPUs.
  • Install, benchmark, and evaluate pre-release hardware for early-stage evaluation and prototyping by identifying (or developing) relevant workloads.

Minimum Qualifications

  • Masters / PhD in Computer Science or related fields; bachelors degree holders with relevant experience and extraordinary track-record will also be considered.
  • Deep understanding of operating systems, computer networks, and high performance applications
  • Good mental model of the architecture of a modern distributed systems that is comprised of CPUs, GPUs, and accelerators.
  • Experience with deployments of deep-learning frameworks based on TensorFlow, and PyTorch on large-scale on-prem or cloud infrastructures.
  • Strong background in modern and advanced C++ concepts
  • Strong Scripting Skills in Bash, Python, or similar.
  • Good communication.

Things to Make us go Wow!

  • Experience in heterogenous programming languages like CUDA, Triton, etc.
  • Experience with model development on DL frameworks such as TensorFlow, and PyTorch
  • Experience with building open-source operating systems and software stack on pre-release hardware.
  • Solid understanding of container infrastructure such as Docker or singularity, and Kubernetes.
  • Active participation in C++ standards bodies or similar

We offer a competitive, family friendly total rewards package. We design our programs to reflect our commitment to an inclusive environment, while ensuring we provide benefits that meet the diverse needs of our employees.

KLA is proud to be an equal opportunity employer

Apply Now

Date Posted

09/25/2024

Views

3

Back to Job Listings Add To Job List Company Profile View Company Reviews
Positive
Subjectivity Score: 0.8

Similar Jobs

Sales Development Representative - UK (Remote) - Dscout

Views in the last 30 days - 0

Dscout is a company that specializes in experience research solutions helping innovative companies like Salesforce Sonos Groupon and Best Buy to build...

View Details

Operations Functional Support Specialist - Dynamics 365 Finance & Operations (IN) - Cencora

Views in the last 30 days - 0

Cencora is seeking a Functional Support Specialist with 25 years of experience in Microsoft Dynamics 365 Finance Operations particularly in Operation...

View Details

Senior Sourcing Specialist - UL Solutions

Views in the last 30 days - 0

The job description outlines a role in sourcing and supplier management for a global leader in applied safety science UL Solutions The position involv...

View Details

Software Architecture Engineering and Cloud Computing Engineer - The Aerospace Corporation

Views in the last 30 days - 0

The Aerospace Corporation is seeking a Senior Project Engineer with expertise in software architecture engineering and cloud computing The role involv...

View Details

Software Engineering Manager - Cargill

Views in the last 30 days - 0

The Software Engineering Manager job involves setting goals for a team responsible for software project development and delivery ensuring quality stan...

View Details

Intern People Experience - Personio

Views in the last 30 days - 0

Personio is an HR platform that simplifies complex tasks for small and mediumsized organizations With a team of over 1800 employees across Europe and ...

View Details