System Engineer

Supermicro · South Bay

Company

Supermicro

Location

South Bay

Type

Full Time

Job Description

Job Req ID: 23069

About Supermicro:

Supermicro® is a Top Tier provider of advanced server, storage, and networking solutions for Data Center, Cloud Computing, Enterprise IT, Hadoop/ Big Data, Hyperscale, HPC and IoT/Embedded customers worldwide. We are the #5 fastest growing company among the Silicon Valley Top 50 technology firms. Our unprecedented global expansion has provided us with the opportunity to offer a large number of new positions to the technology community. We seek talented, passionate, and committed engineers, technologists, and business leaders to join us.

Job Summary:

As a System Engineer, you'll be the go-to person to roll out and maintain business critical applications and services for Supermicro. You are also responsible for resolving escalated service issues, coaching other engineers to resolutions, engineering and implementing complex projects. You will be a person who is independent with leadership to drive the technical development and with excellent communication skills.

Essential Duties and Responsibilities:

Includes the following essential duties and responsibilities (other duties may also be assigned):• Perform Cluster/Rack level testing and software deployment for local/onsite customers• Responsible for Cloud, Storage, and AI/Deep Learning benchmarks and testing• Responsible for proof-of-concepts (PoCs) setup and network troubleshooting• Perform the testing for AI applications using ML/DL frameworks such as MLPerf, LLM, and RAG• Conduct functionality testing, compatibility testing, performance testing, stress, and reliability testing• Report hardware and software quality issues and work with other teams to solve the issues • Document and analyze test data and test logs, write a test report• Contribute to the development of test utilities and test script automation• Support internal and external quality issues and drive issue resolution

Qualifications:• BS / MS in Electrical Engineering, Computer Engineering or Computer Science• 5+ years of work-related experience in Deep Learning and Machine Learning• 5+ years of Linux/networking debugging/testing or relevant experience preferred• Experience with leading AI/ML frameworks such as PyTorch, TensorFlow, ONNX, etc. • Experience with DevOps or in cloud environments, including but not limited to Docker/Containers and Kubernetes• Hands-on experience with workload/scheduler Managers (Slurm) for rack/cluster• Familiar with MLPerf Training/Inference benchmark, LLM, HPL-AI or RCCL/NCCL• Familiar with Openstack, Openshift, Azure or AWS• Programming experience with windows and Linux shell scripting• Strong sense of teamwork and good team player, strong communication skills• Familiar with Intel/AMD/NVIDIA development tool kits like CUDA, oneAPI, ROCm is a plus• Experience with server/network hardware debugging and troubleshooting is a plus

Physical requirements:• Able to stand, walk, sit, talk, listen, crouch or crawl, reach with hands and arms• Able to lift, carry, push and pull up to 50 pounds• Able to work in noisy, cold and hot environment such as lab, production line and data center

Salary Range

$82,000 - $133,000

The salary offered will depend on several factors, including your location, level, education, training, specific skills, years of experience, and comparison to other employees already in this role. In addition to a comprehensive benefits package, candidates may be eligible for other forms of compensation, such as participation in bonus and equity award programs.

EEO Statement

Supermicro is an Equal Opportunity Employer and embraces diversity in our employee population. It is the policy of Supermicro to provide equal opportunity to all qualified applicants and employees without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, protected veteran status or special disabled veteran, marital status, pregnancy, genetic information, or any other legally protected status.

Date Posted

03/16/2024

Views

6

Back to Job Listings Add To Job List Company Profile View Company Reviews
Positive
Subjectivity Score: 0.9

Similar Jobs

Staff Engineer, System Design Verification Engineering - Western Digital

Views in the last 30 days - 0

Western Digital is seeking a validation engineer to define and track test plans characterize and optimize SSDs and lead bug review meetings The ideal ...

View Details

Staff Flight Test Engineer - Wisk

Views in the last 30 days - 0

Wisk Aero is seeking a Staff Flight Test Engineer to join their team in Hollister CA The role involves ensuring safe and efficient flight testing and ...

View Details

Senior Developer, Data Engineer - Tarana Wireless, Inc.

Views in the last 30 days - 0

Tarana is seeking a Senior DeveloperData Engineer with 5 years of experience in building largescale data pipelines The role involves designing buildin...

View Details

Technologist, System Design Engineering - Western Digital

Views in the last 30 days - 0

Western Digital is seeking a Technologist with expertise in SSD design hardware design Product Management Memory Systems and system architecture to le...

View Details

Servo Development Engineer - Western Digital

Views in the last 30 days - 0

Western Digital a company with over 50 years of experience in data storage is seeking a skilled professional to optimize highperformance and robust po...

View Details

Senior Front-End Software Engineer - Percipient.ai

Views in the last 30 days - 0

Percipientai founded in 2017 is a cuttingedge technology company specializing in Computer Vision Artificial Intelligence and Deep Learning They develo...

View Details