Lead HPC Infrastructure Engineer

EPAM Systems Rockaway, NJ

Company

EPAM Systems

Location

Rockaway, NJ

Type

Full Time

Job Description

We are seeking an engineer with great knowledge and skills in HPC (High Performance Computing) infrastructure. This position involves daily operations and engineering tasks and requires an individual with a deep-rooted engineering background complemented by extensive deployment and optimization experience. If you are passionate about HPC and equipped with the necessary skills and experience, we look forward to receiving your application!
Unlock the potential of remote work in Kyrgyzstan, giving you the flexibility to work from home or access our office in Bishkek.

#LI-DNI#LI-VA2

Responsibilities

  • Participation in all aspects of HPC infrastructure support
  • Implementing IaC (Infrastructure as Code) automation
  • Contributing to incident resolution in addition to managing software and hardware upgrades
Requirements

Want more jobs like this?

Get jobs in Rockaway, NJ delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.
  • Solid experience in the HPC technical domain
  • Engineering or Development background
  • Mastery in configuration and support of HPC infrastructure
  • Proficiency in Linux (any RPM-based), inclusive of kernel modules compilation, debugging tools such as strace, coredump, tcpdump, etc
  • Experience in dealing with job schedulers like IBM LSF and Slurm
  • Fundamental knowledge of Bright Cluster Manager, GPFS/Lustre filesystems, and InfiniBand/OmniPath network interconnect
  • English of at least B1+ and above
Nice to have
  • Experience in diagnosing, upgrading, and tuning hardware components like HCA InfiniBand, disk arrays (Lustre, Vast, IBM), and Dell/HP servers
  • Expertise in infrastructure monitoring through Zabbix, Splunk, Grafana, etc
  • Knowledge of Easybuild
  • Prior experience of working in a GxP environment
  • Familiarity with Jira and ServiceNow
We offer
  • We connect like-minded people::
    • Delivering innovative solutions to industry leaders, making a global impact
    • Enjoyable working environment, whether it is the vibrant office or the comfort of your own home
    • Opportunity to work abroad for up to two months per year
    • Relocation opportunities within our offices in 55+ countries
    • Corporate and social events
  • We invest in your growth::
    • Leadership development, career advising, soft skills and well-being programs
    • Certifications, including GCP, Azure and AWS
    • Unlimited access to LinkedIn Learning, Get Abstract, O'Reilly
    • Free English classes with certified teachers
  • We cover it all::
    • Monetary bonuses for engaging in the referral program
    • Medical & family care package
    • Six trust days per year (sick leave without a medical certificate)
    • Coverage of psychology sessions of your choice
    • Discounts for fitness clubs and sports programs
    • Benefits package (sports activities, a variety of stores and services)
EPAM Kyrgyzstan is a team of technologists and innovators united by a passion for technology. In 2022, we opened our first office in Bishkek that works with the world's leading companies across many different industries. EPAM builds a continuously learning organization and helps its employees reach their full potential and achieve their professional goals through learning. Our agile methodologies, client collaboration frameworks, engineering excellence programs, and hybrid teams offer many career paths and development opportunities.

Apply Now

Date Posted

01/23/2025

Views

0

Back to Job Listings ❤️Add To Job List Company Info View Company Reviews
Neutral
Subjectivity Score: 0

Similar Jobs

Data Leak Protection Analyst - Barclays

Views in the last 30 days - 0

OR for an individual contributor they develop technical expertise in work area acting as an advisor where appropriate

View Details

Bathroom Installer - Premier Home Pros

Views in the last 30 days - 0

Basic knowledge of hand tools and power tools Must have a valid smart phone or smart device in order to receive work orders and update our CRM

View Details

AWS DevOps Engineer - AllShifts

Views in the last 30 days - 0

The AWS Engineer will be responsible for applying expertise to develop and execute requirements procedures and guidelines for AWS Infrastructure ensur...

View Details

Cybersecurity Analyst - Balchem Corporation

Views in the last 30 days - 0

Familiarity with enterpriseclass detection endpoint protection and vulnerability assessment technologies This role combines security operations monito...

View Details

Welder I - Core & Main

Views in the last 30 days - 0

Are you able to read CAD or hand drawings to calculate and accurately measure cut fitup and align piping for the welding of various configurations

View Details

Radar Software Engineer - In-Depth Engineering Corporation

Views in the last 30 days - 0

Perform in an agile fast paced environment applying advanced technologies software architecture design verification validation scientific principles a...

View Details