Application Site Reliability Engineer - Data&AI
Job Description
Seeking new possibilities and always staying curious? We are a team dedicated to creating
the worldβs leading AI-powered cloud-native software solutions for our customers. Our
renowned legacy creates endless global opportunities for our IBMers so the door is
always open for those who want to grow their career.
A career in IBM Software means youβll be part of a team that transforms our customers
challenges into solutions.
Your Role and Responsibilities
As a Site Reliability Engineer you will specialize in ensuring the reliability and resiliency of our systems. Bringing a unique blend of knowledge and skills in both software and systems you will play a key role in analyzing business needs identifying and solving problems advising and designing solutions building and testing deploying and managing changes and maintaining well-engineered information systems and ecosystems.
Key Responsibilities:
- Reliability and Resilience
- Specialize in ensuring the reliability and resiliency of systems fostering a high-availability environment.
- Problem Analysis and Resolution
- Analyze business needs and proactively identify and solve problems to enhance system performance and stability.
- End-to-End Engineering:
- Play a pivotal role in advising designing building testing deploying and maintaining well-engineered information systems.
Required Technical and Professional Expertise
β Experience in a software development and delivery role
β Experience in Cloud/DevOps engineering and/or Linux administration
β Experience with at least one major public cloud provider or large scale private/hybrid cloud using container orchestration
β Experience with a modern configuration management framework (Ansible Chef Puppet)
β Production experience with one or more monitoring/observability tools (Prometheus Grafana Zabbix etc.)
β Scripting skills in at least one language (BASH Python Ruby etc.)
β Experience with source control management (git subversion etc.)
β Understanding of software development life cycle and delivery process
β Ability to manage multiple tasks while ensuring that commitments and timetables are met
β Ability to partner with internal stakeholders to design operational solutions
β Ability to work with short timeline and under stress and oncall
β Goal oriented forward thinker that can provide solutions for complex technical problems
Preferred Technical and Professional Expertise
β Production Kubernetes/OpenShift experience preferred
β Experience with pipeline tools for deploying and managing applications
β Experience with Prometheus Grafana Loki
β Experience with Infrastructure as Code tools
β Comfortable developing with and analyzing issues in databases (relational and nosql)
Explore More
Date Posted
11/24/2023
Views
0
Similar Jobs
Site Reliability Engineer - IBM
Views in the last 30 days - 0
The job posting is for a Site Reliability Engineer SRE at IBM responsible for ensuring the reliability and scalability of systems and services The rol...
View DetailsSenior Software Engineer - Backend/Java - IBM
Views in the last 30 days - 0
The text describes a role as a Software Engineer for IBM Infrastructure focusing on data integration capabilities and building scalable highperformanc...
View DetailsSenior Machine Learning Engineer - IBM
Views in the last 30 days - 0
WatsonX Orders is an IBM Silicon Valley based technology development group focusing on conversational AI for the quick service restaurant environment ...
View DetailsInfrastructure Security Engineer, WatsonX Orders – ML - IBM
Views in the last 30 days - 0
The job posting is for a skilled Security Engineer to secure infrastructure and applications across AWS k8s and edge locations for an MLpowered AI com...
View DetailsSoftware Engineer (AI) - IBM
Views in the last 30 days - 0
IBM is seeking a Software Engineer with experience in Python Machine Learning and AI to work on the IBM Watson XAI offering from the Kraków Poland off...
View DetailsPlatform Engineer - IBM
Views in the last 30 days - 0
IBM Software is seeking skilled Platform Engineers to develop and maintain cloudnative software solutions The role involves developing scalable distri...
View Details