Platform Site Reliability Engineer

IBM · PL

Company

IBM

Location

PL

Type

Full Time

Job Description

Introduction
Seeking new possibilities and always staying curious?
We are a team dedicated to creating the world’s leading AI-powered cloud-native software solutions for our customers. Our renowned legacy creates endless global opportunities for our IBMers so the door is always open for those who want to grow their career.
A career in IBM Software means you’ll be part of a team that transforms our customers’ challenges into solutions.

Your Role and Responsibilities
As a Site Reliability Engineer you will build the next generation of software automation and tooling to ensure the reliability and resiliency of our systems. Bringing a unique blend of development experience and skills in both software and systems you will play a key role in analyzing business needs identifying and solving problems advising and designing solutions building and testing deploying and managing changes and maintaining well-engineered information systems and ecosystems.

Key Responsibilities:

β€’ Identify and investigate issues using troubleshooting techniques in order to provide advice and guidance to clients
β€’ Work in a global team located in US Canada Ireland China India and Australia collaborating with IBMers to share recommendations solutions and ideas
β€’ Look for enhancements and innovative solutions to help the services scale and improve existing technical support tools procedures or processes.
β€’ Develop and enhance your technical knowledge via projects and assignments as well as through IBM’s world class learning platform
β€’ Be on on-duty rotation including weekend and holiday support as needed basis

Required Technical and Professional Expertise
– Experience in a software development and delivery role
– Experience in Cloud/DevOps engineering and/or Linux administration
– Experience with at least one major public cloud provider or large scale private/hybrid cloud using container orchestration
– Experience with a modern configuration management framework (Ansible Chef Puppet)
– Production experience with one or more monitoring/observability tools (Prometheus Grafana Zabbix etc.)
– Scripting skills in at least one language (BASH Python Ruby etc.)
– Experience with source control management such as git subversion etc.
– Understanding of software development life cycle and delivery process
– Ability to manage multiple tasks while ensuring that commitments and timetables are met
– Ability to partner with internal stakeholders to design operational solutions
– Ability to work with short timeline and under stress and oncall
– Goal oriented forward thinker that can provide solutions for complex technical problems
– Fluent in spoken and written English

Preferred Technical and Professional Expertise
– Production Kubernetes/OpenShift experience
– Experience with pipeline tools for deploying and managing applications
– Experience with Prometheus Grafana Loki
– Experience with Infrastructure as Code tools
– Comfortable developing with and analyzing issues in databases (relational and nosql)

Apply Now

Date Posted

11/23/2023

Views

0

Back to Job Listings Add To Job List Company Profile View Company Reviews
Positive
Subjectivity Score: 0.8

Similar Jobs

Senior Application Site Reliability Engineer - IBM

Views in the last 30 days - 0

Experience with source control management git subversion etc Experience with log aggregation tools Elastic Loki Mezmo etc Experience in software de...

View Details

Backend Developer - IBM

Views in the last 30 days - 4

Competitive experience in startups or a fastpaced enterprise environment Aptitude for learning and applying new technologies Prior experience working ...

View Details