Hardware Developer

IBM · US Poughkeepsie

Company

IBM

Location

US Poughkeepsie

Type

Full Time

Job Description

Introduction
At IBM work is more than a job – it’s a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better but to attempt things you’ve never thought possible. Are you ready to lead in this new era of technology and solve some of the world’s most challenging problems? If so lets talk.

Your Role and Responsibilities
In this role you will be completely responsible for all aspects of hardware bringup test delivery and repair for a subset of our hardware focusing on GPU systems. This includes running all initial bringup steps as well as configuration checks to ensure the racks and servers were built correctly and that all expected components are recognized. It also includes performing stress testing of CPU GPU memory storage network.

In this role you must use their existing experience in GPU based servers to diagnose all problems and provide actions plans to datacenter personnel follow up on actions and retest after repairs are completed. Excellent documentation and communication skills are required along with Linux and relevant hardware experience. You will be responsible to complete hardware bringup and test before strict deadlines. You will execute automation code from the Linux command line to bringup and test the systems analyze the command line results and logs. You will not be required to write the bringup and test suite but it is expected that if the python code crashes You will attempt to diagnose what line of code is the problem and suggest a fix to the automation team. Python programming experience is not required but in this role you should be willing to learn and be interested to participate with feedback on how to fix and improve the automation. A candidate with excellent hardware and Linux skills who can also perform object oriented python programming would be ideal.

In addition to the primary responsibility of bringup and test of hardware and repair of hardware you will be expected to engage in hands on activity with a reasonably small quantity of relevant GPU systems in the IBM Poughkeepsie lab to maintain and improve their expertise with the latest hardware and to assist in qualification activity for new hardware. This includes hands on work joint work on firmware and BIOS with the qualification team and joint work with the automation team. Security fixes to firmware is another aspect to this job and in this role you are expected to oversee and help drive qualification activities for new firmware and to be responsible to upgrade firmware on systems in production using the automation suite.

NAIWD24


Required Technical and Professional Expertise

  • You need to be very detail oriented with excellent documentation and communication skills.
  • You must have basic layer 2 and linux networking skills including ARP and DHCP are expected; deeper network skills would be extremely useful as well.
  • You need to be capable of executing bringup and diagnostic test code from the linux command line interpret results and error messages as they apply to the hardware and inspect log files.
  • You need to be able to interact with on site datacenter personnel to provide repair instructions.
  • You need to be able to rapidly stand up new racks of server hardware ensuring it is free of defects and is ready for customer consumption and also diagnose field failures quickly drive action plans for repair with the data center personnel and get the repaired servers back into production for customer use.
  • You must have familiarity with GPU based systems and be familiar with server hardware subsystems including proc mem storage io adapters PCI busses and switches BMCs as well as skills required for server bringup including fw/bios updates bios settings network booting.
  • Knowledge of security vulnerabilities and patching of systems would be useful.
  • Ideally you must be capable of object oriented python programming in order to not only use but also contribute to the hardware test suite for bringup and diagnosis of GPU based systems.


Preferred Technical and Professional Expertise

  • Capability to design program read and debug in object oriented python code.
  • Knowledge of JIRA Redfish xCAT images switches BGP/ECMP are a plus but not required.
  • Knowledge of security vulnerabilities and patching of systems would be useful.
Apply Now

Date Posted

03/22/2024

Views

15

Back to Job Listings Add To Job List Company Profile View Company Reviews
Positive
Subjectivity Score: 0.8

Similar Jobs

Hardware Developer - Poughkeepsie - IBM

Views in the last 30 days - 0

The job role is thermal hardware developer responsible for creating thermal simulations and presenting results to system architects and management The...

View Details

Software Developer (Poughkeepsie, NY) - IBM

Views in the last 30 days - 0

IBM is seeking a highly motivated software developer to join the dynamic and innovative IBM AI on Z team The team is responsible for expanding the sup...

View Details

Entry level Software Developer - IBM

Views in the last 30 days - 0

The text describes a job opening for a Mainframe Tester position at IBM Z systems The candidate will be responsible for writing test cases testing cod...

View Details

Software Developer - IBM

Views in the last 30 days - 0

IBM is seeking a software developer to join the AI on Z team developing solutions for AI efforts on the Z platform working with top minds in AI and ML...

View Details

Back-End Developer - Entry level - IBM

Views in the last 30 days - 0

The job role is for a programmerengineer to work on the System Assurance Kernel SAK team designing coding testing and debugging architecture verificat...

View Details

QA/Test Developer - Entry level - IBM

Views in the last 30 days - 0

The job posting is for a QA tester to join the zOS Customer Test group The role involves designing and automating test suites having a clientfirst min...

View Details