Job Description
Seeking new possibilities and always staying curious?
We are a team dedicated to creating the worldβs leading AI-powered cloud-native software solutions for our customers. Our renowned legacy creates endless global opportunities for our IBMers so the door is always open for those who want to grow their career.
A career in IBM Software means youβll be part of a team that transforms our customersβ challenges into solutions.
Your Role and Responsibilities
As a Site Reliability Engineer you will build the next generation of software automation and tooling to ensure the reliability and resiliency of our systems. Bringing a unique blend of development experience and skills in both software and systems you will play a key role in analyzing business needs identifying and solving problems advising and designing solutions building and testing deploying and managing changes and maintaining well-engineered information systems and ecosystems.
Key Responsibilities:
β’ Identify and investigate issues using troubleshooting techniques in order to provide advice and guidance to clients
β’ Work in a global team located in US Canada Ireland China India and Australia collaborating with IBMers to share recommendations solutions and ideas
β’ Look for enhancements and innovative solutions to help the services scale and improve existing technical support tools procedures or processes.
β’ Develop and enhance your technical knowledge via projects and assignments as well as through IBMβs world class learning platform
β’ Be on on-duty rotation including weekend and holiday support as needed basis
Required Technical and Professional Expertise
β Experience in a software development and delivery role
β Experience in Cloud/DevOps engineering and/or Linux administration
β Experience with at least one major public cloud provider or large scale private/hybrid cloud using container orchestration
β Experience with a modern configuration management framework (Ansible Chef Puppet)
β Production experience with one or more monitoring/observability tools (Prometheus Grafana Zabbix etc.)
β Scripting skills in at least one language (BASH Python Ruby etc.)
β Experience with source control management such as git subversion etc.
β Understanding of software development life cycle and delivery process
β Ability to manage multiple tasks while ensuring that commitments and timetables are met
β Ability to partner with internal stakeholders to design operational solutions
β Ability to work with short timeline and under stress and oncall
β Goal oriented forward thinker that can provide solutions for complex technical problems
β Fluent in spoken and written English
Preferred Technical and Professional Expertise
β Production Kubernetes/OpenShift experience
β Experience with pipeline tools for deploying and managing applications
β Experience with Prometheus Grafana Loki
β Experience with Infrastructure as Code tools
β Comfortable developing with and analyzing issues in databases (relational and nosql)
Explore More
Date Posted
11/23/2023
Views
0
Similar Jobs
Senior Application Site Reliability Engineer - IBM
Views in the last 30 days - 0
Experience with source control management git subversion etc Experience with log aggregation tools Elastic Loki Mezmo etc Experience in software de...
View DetailsBackend Developer - IBM
Views in the last 30 days - 4
Competitive experience in startups or a fastpaced enterprise environment Aptitude for learning and applying new technologies Prior experience working ...
View Details