HPC Engineer

· Remote

Location

Remote

Type

Full Time

Job Description

ArmJobs
HPC Engineer

HPC Engineer

Reposted 3 Hours Ago
Be an Early Applicant
Austin TX USA
Hybrid
130K-176K Annually
Mid level
Artificial Intelligence • Internet of Things • Semiconductor
Build What the World Depends On
The Role
The HPC Engineer will operate and enhance Arm's HPC platforms focusing on reliability automation and user experience while collaborating with engineering and infrastructure teams.
Summary Generated by Built In
Job Overview
Engineering IT provides the high-performance compute platforms that enable Arm's engineering teams to design verify and deliver world-class products. The team operates a mix of on-premises and cloud-based HPC environments EDA enablement services job scheduling platforms automation tooling and custom workflows that are critical to engineering productivity across Arm.
We are looking for an HPC Operations Engineer to help run improve and modernize these services. This role combines production operations site reliability engineering automation cloud integration and close collaboration with engineering users and infrastructure teams.
Responsibilities
  • Operate support and continuously improve Arm's HPC platforms with a solid focus on IBM Spectrum LSF and related job scheduling services.
  • Improve reliability scalability performance and operational efficiency through automation observability standardization and SRE practices.
  • Develop automation and self-service capabilities to reduce manual operational effort and improve the user experience.
  • Support production HPC environments including incident response solve root cause analysis service restoration and continuous improvement.
  • Work directly with engineering users to improve job scheduling behavior workload performance resource utilization and platform efficiency.
  • Develop and maintain scripts tools and automation frameworks using Python Bash and related technologies.
  • Support modernization initiatives involving containers Kubernetes Docker cloud-native services Infrastructure as Code and alternative scheduling or orchestration technologies.
  • Contribute to cloud HPC integration across AWS GCP Azure OpenStack and hybrid environments.
  • Collaborate with platform cloud storage infrastructure networking and security teams to deliver robust engineering services.
  • Contribute to project delivery by working with technical leads architects project managers and operational team members.
  • Help define and promote standards for DevOps SRE platform engineering CI/CD monitoring and infrastructure automation.

Required Skills and Experience
  • Experience operating HPC environments and job schedulers such as IBM Spectrum LSF Slurm PBS Grid Engine or similar.
  • Strong Linux system administration experience preferably with RHEL or RHEL-based distributions.
  • Good scripting and automation skills using Python Bash Shell or similar languages.
  • Experience supporting production infrastructure including incident management solve operational recovery and conducting RCA or comparable experience.
  • Familiarity with monitoring alerting and observability platforms such as Dynatrace Prometheus Grafana or similar.
  • Experience building maintaining or supporting CI/CD pipelines and automation frameworks.
  • Experience with public private or hybrid cloud platforms including AWS GCP Azure OpenStack and Kubernetes-based services.
  • Understanding of DevOps SRE platform engineering infrastructure automation and operational excellence principles.
  • Familiarity with Agile delivery practices and collaboration tools such as Jira and Confluence.
  • Ability to work with engineering users understand workload requirements and translate operational issues into practical improvements.

Desirable Experience
  • Experience working in EDA or semiconductor engineering environments.
  • Familiarity with EDA tools license-aware scheduling large-scale batch workloads and engineering compute workflows.
  • Exposure to container platforms and orchestration technologies such as Docker Kubernetes and Kubernetes-native scheduling.
  • Experience with Infrastructure as Code tools such as Terraform and Ansible.
  • Exposure to alternative schedulers such as Slurm or cloud-native workload orchestration systems.
  • Experience using AI-assisted tooling MCP agentic services or automation agents to improve diagnostics operations optimization or self-service support.
  • Experience operating large-scale distributed systems across both on-premises and cloud infrastructure.

Salary Range:
$130100-$176000 per year
We value people as individuals and our dedication is to reward people competitively and equitably for the work they do and the skills and experience they bring to Arm. Salary is only one component of Arm's offering. The total reward package will be shared with candidates during the recruitment and selection process.
Accommodations at Arm
At Arm we want to build extraordinary teams. If you need an adjustment or an accommodation during the recruitment process please email [email protected] . To note by sending us the requested information you consent to its use by Arm to arrange for appropriate accommodations. All accommodation or adjustment requests will be treated with confidentiality and information concerning these requests will only be disclosed as necessary to provide the accommodation. Although this is not an exhaustive list examples of support include breaks between interviews having documents read aloud or office accessibility. Please email us about anything we can do to accommodate you during the recruitment process.
Hybrid Working at Arm
Arm's approach to hybrid working is designed to create a working environment that supports both high performance and personal wellbeing. We believe in bringing people together face to face to enable us to work at pace whilst recognizing the value of flexibility. Within that framework we empower groups/teams to determine their own hybrid working patterns depending on the work and the team's needs. Details of what this means for each role will be shared upon application. In some cases the flexibility we can offer is limited by local legal regulatory tax or other considerations and where this is the case we will collaborate with you to find the best solution. Please talk to us to find out more about what this could look like for you.
Equal Opportunities at Arm
Arm is an equal opportunity employer committed to providing an environment of mutual respect where equal opportunities are available to all applicants and colleagues. We are a diverse organization of dedicated and innovative individuals and don't discriminate on the basis of race color religion sex sexual orientation gender identity national origin disability or status as a protected veteran.

Skills Required

  • Experience operating HPC environments and job schedulers such as IBM Spectrum LSF Slurm PBS or Grid Engine
  • Strong Linux system administration experience preferably with RHEL
  • Good scripting and automation skills using Python Bash Shell or similar
  • Experience supporting production infrastructure and incident management
  • Familiarity with monitoring and observability platforms such as Dynatrace Prometheus Grafana
  • Experience building and maintaining CI/CD pipelines and automation frameworks
  • Experience with public private or hybrid cloud platforms including AWS GCP Azure
  • Understanding of DevOps SRE platform engineering infrastructure automation principles
  • Familiarity with Agile delivery practices and collaboration tools such as Jira and Confluence
  • Ability to work with engineering users and understand workload requirements
Am I A Good Fit?
beta
Expert contributor network
Get Personalized Job Insights.
Our AI-powered fit analysis compares your resume with a job listing so you know if your skills & experience align.

The Company
HQ: Cambridge England
8314 Employees
Year Founded: 1990

What We Do

We bring brilliant people together in a global ecosystem that is sparking the world’s potential. Arm technology enables specialized processing built on the economics design freedom and accessibility of general-purpose compute that has so far led to more than 180 billion chips being shipped by our partners.

Why Work With Us

At Arm we build the future of computing powering everything from smartphones to AI. Our 10x mindset drives bold thinking and deep collaboration to solve complex problems together. With a people first culture flexible work and strong support for growth and wellbeing your ideas can make a global impact while your career thrives.

Gallery

Arm Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

Typical time on-site: Not Specified
HQCambridge UK
Galway Ireland
Budapest Hungary
Sophia Antipolis France
Ra'anana Israel
Bengaluru India
Noida India
Yokohama Japan
Seoul South Korea
Hsinchu Taiwan
Taipei Taiwan
Munich Germany
Austin TX
Bristol UK
Chandler AZ
Raleigh NC
Lund Sweden
Manchester England
Oslo Norway
San Diego CA
San Jose CA
Sheffield UK
Trondheim Norway
Boston MA
Learn more

Similar Jobs

Arm

Solutions Lead Robotics - Go-To-Market

Artificial Intelligence • Internet of Things • Semiconductor
Hybrid
2 Locations
8314 Employees
260K-352K Annually

Arm

Senior Director Physical AI - Go-To-Market (Robotics)

Artificial Intelligence • Internet of Things • Semiconductor
Hybrid
2 Locations
8314 Employees
322K-435K Annually

Arm

Program Coordinator

Artificial Intelligence • Internet of Things • Semiconductor
Hybrid
Austin TX USA
8314 Employees
128K-174K Annually

Arm

Senior Director Physical AI - GTM (China Market)

Artificial Intelligence • Internet of Things • Semiconductor
Hybrid
Austin TX USA
8314 Employees
309K-418K Annually
Apply Now

Date Posted

06/07/2026

Views

0

Back to Job Listings Add To Job List Company Profile View Company Reviews
Neutral
Subjectivity Score: 0
142,000+ Jobs Tracked
12,400+ Companies
1,930 Categories