Staff AI Infrastructure Engineer
Job Description
XPeng Motors is one of Chinaβs leading smart electric vehicle (EV) companies. We design, develop, and manufacture smart EVs that are seamlessly integrated with advanced Internet, AI and autonomous driving technologies. We are committed to in-house R&D and intelligent manufacturing to create a better mobility experience for our customers. We strive to transform smart electric vehicles with technology and data, shaping the mobility experience of the future.
Β
We are looking for a talented AI/ML Infrastructure Engineer to join our team. In this role, you will have the opportunity to improve productivity for our researchers by enhancing the entire stack. Your primary duty will be to identify and resolve infrastructure gaps to provide reliable, efficient, and scalable solutions.
Β
Job Responsibilities:
-
Identify and resolve infrastructure gaps to ensure reliable, efficient, and scalable solutions
-
Develop advanced AI/ML infrastructure solutions that enhance the efficiency of our skilled ML teams
-
Design and implement solutions for critical areas, including distributed storage systems, scheduling systems, high availability capabilities, and core reliability issues within our large-scale GPU clusters
-
Monitor and optimize the performance of our AI/ML infrastructure, ensuring high availability, scalability, and efficient resource utilization
-
Develop and deploy automation tools, monitoring solutions, and operational strategies to streamline infrastructure management and reduce manual tasks
-
Work with various teams, including ML developers, data engineers, and DevOps professionals, to create a cohesive and integrated AI/ML infrastructure ecosystem
Minimum Skill Requirements:
-
Bachelor's degree in Computer Science, Engineering, or related technical field
-
5-8+ years of experience in software engineering, with a strong background in developing and managing large-scale distributed systems, ideally within the AI/ML infrastructure domain
-
Proficiency in programming languages such as Python, Go, or C++, with knowledge of cloud computing platforms like AWS, Azure, etc.
-
Strong communication and collaboration abilities, effective in working with diverse teams and individuals
Β
Preferred Skill Requirements:
-
In-depth understanding of AI/ML workflows, including model training, data processing, and inference pipelines
-
Practical experience with containerization technologies (i.e., Docker, Kubernetes), automation tools (i.e., Ansible, Terraform), and monitoring solutions (i.e., Prometheus, Grafana)
-
Exceptional problem-solving skills, capable of analyzing complex systems, identifying bottlenecks, and implementing scalable solutions
-
A passion for continuous learning and staying abreast of new technologies and best practices in the AI/ML infrastructure space
What do we provide:
-
A fun, supportive and engaging environment
-
Opportunity to make significant impact on the transportation revolution by the means of advancing autonomous driving
-
Opportunity to work on cutting edge technologies with the top talent in the field
-
Competitive compensation package
-
Snacks, lunches and fun activities
Β
The base salary range for this full-time position is $180,000-$300,000, in addition to bonus, equity and benefits. Our salary ranges are determined by role, level, and location. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training.
Β
We are an Equal Opportunity Employer. It is our policy to provide equal employment opportunities to all qualified persons without regard to race, age, color, sex, sexual orientation, religion, national origin, disability, veteran status or marital status or any other prescribed category set forth in federal or state regulations.
Explore More
Date Posted
07/10/2024
Views
0
Similar Jobs
Staff Flight Test Engineer - Wisk
Views in the last 30 days - 0
Wisk Aero is seeking a Staff Flight Test Engineer to join their team in Hollister CA The role involves ensuring safe and efficient flight testing and ...
View DetailsStaff Engineer, System Design Verification Engineering - Western Digital
Views in the last 30 days - 0
Western Digital is seeking a validation engineer to define and track test plans characterize and optimize SSDs and lead bug review meetings The ideal ...
View DetailsSenior Developer, Data Engineer - Tarana Wireless, Inc.
Views in the last 30 days - 0
Tarana is seeking a Senior DeveloperData Engineer with 5 years of experience in building largescale data pipelines The role involves designing buildin...
View DetailsServo Development Engineer - Western Digital
Views in the last 30 days - 0
Western Digital a company with over 50 years of experience in data storage is seeking a skilled professional to optimize highperformance and robust po...
View DetailsSenior Front-End Software Engineer - Percipient.ai
Views in the last 30 days - 0
Percipientai founded in 2017 is a cuttingedge technology company specializing in Computer Vision Artificial Intelligence and Deep Learning They develo...
View DetailsPrincipal Software Engineer (Prisma Access) - Palo Alto Networks
Views in the last 30 days - 0
Palo Alto Networks is a cybersecurity company committed to protecting the digital way of life They are seeking a Principal Software Engineer to build ...
View Details