Lead HPC Infrastructure Engineer
Company
EPAM Systems
Location
Rockaway, NJ
Type
Full Time
Job Description
We are seeking an engineer with great knowledge and skills in HPC (High Performance Computing) infrastructure. This position involves daily operations and engineering tasks and requires an individual with a deep-rooted engineering background complemented by extensive deployment and optimization experience. If you are passionate about HPC and equipped with the necessary skills and experience, we look forward to receiving your application!
Unlock the potential of remote work in Kyrgyzstan, giving you the flexibility to work from home or access our office in Bishkek.
#LI-DNI#LI-VA2
Responsibilities
- Participation in all aspects of HPC infrastructure support
- Implementing IaC (Infrastructure as Code) automation
- Contributing to incident resolution in addition to managing software and hardware upgrades
Want more jobs like this?
Get jobs in Rockaway, NJ delivered to your inbox every week.

- Solid experience in the HPC technical domain
- Engineering or Development background
- Mastery in configuration and support of HPC infrastructure
- Proficiency in Linux (any RPM-based), inclusive of kernel modules compilation, debugging tools such as strace, coredump, tcpdump, etc
- Experience in dealing with job schedulers like IBM LSF and Slurm
- Fundamental knowledge of Bright Cluster Manager, GPFS/Lustre filesystems, and InfiniBand/OmniPath network interconnect
- English of at least B1+ and above
- Experience in diagnosing, upgrading, and tuning hardware components like HCA InfiniBand, disk arrays (Lustre, Vast, IBM), and Dell/HP servers
- Expertise in infrastructure monitoring through Zabbix, Splunk, Grafana, etc
- Knowledge of Easybuild
- Prior experience of working in a GxP environment
- Familiarity with Jira and ServiceNow
- We connect like-minded people::
- Delivering innovative solutions to industry leaders, making a global impact
- Enjoyable working environment, whether it is the vibrant office or the comfort of your own home
- Opportunity to work abroad for up to two months per year
- Relocation opportunities within our offices in 55+ countries
- Corporate and social events
- We invest in your growth::
- Leadership development, career advising, soft skills and well-being programs
- Certifications, including GCP, Azure and AWS
- Unlimited access to LinkedIn Learning, Get Abstract, O'Reilly
- Free English classes with certified teachers
- We cover it all::
- Monetary bonuses for engaging in the referral program
- Medical & family care package
- Six trust days per year (sick leave without a medical certificate)
- Coverage of psychology sessions of your choice
- Discounts for fitness clubs and sports programs
- Benefits package (sports activities, a variety of stores and services)
Date Posted
01/23/2025
Views
0
Similar Jobs
Data Leak Protection Analyst - Barclays
Views in the last 30 days - 0
OR for an individual contributor they develop technical expertise in work area acting as an advisor where appropriate
View DetailsBathroom Installer - Premier Home Pros
Views in the last 30 days - 0
Basic knowledge of hand tools and power tools Must have a valid smart phone or smart device in order to receive work orders and update our CRM
View DetailsAWS DevOps Engineer - AllShifts
Views in the last 30 days - 0
The AWS Engineer will be responsible for applying expertise to develop and execute requirements procedures and guidelines for AWS Infrastructure ensur...
View DetailsCybersecurity Analyst - Balchem Corporation
Views in the last 30 days - 0
Familiarity with enterpriseclass detection endpoint protection and vulnerability assessment technologies This role combines security operations monito...
View DetailsWelder I - Core & Main
Views in the last 30 days - 0
Are you able to read CAD or hand drawings to calculate and accurately measure cut fitup and align piping for the welding of various configurations
View DetailsRadar Software Engineer - In-Depth Engineering Corporation
Views in the last 30 days - 0
Perform in an agile fast paced environment applying advanced technologies software architecture design verification validation scientific principles a...
View Details