Job Description
At IBM work is more than a job – it’s a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better but to attempt things you’ve never thought possible. Are you ready to lead in this new era of technology and solve some of the world’s most challenging problems? If so lets talk.
Your Role and Responsibilities
- In this Site Reliability Engineer role you will work closely with several Data Centers the entire Cloud organization and IBM vendors to support maintain and operationally improve the IBM cloud infrastructure. You will focus on the following key responsibilities:
- Monitor the health of production and test systems 24×7
- Ability to respond promptly to production issues and alerts 24×7
- Execute changes in the production environment through automation
- Partner with other SRE teams and program managers to deliver mission-critical services to the market
- Support development of new and existing capabilities for our compute storage and network infrastructure services
- Implement and automate infrastructure solutions that support IBM Cloud products and infrastructure
- Support the compliance and security integrity of the environment
- Work with Engineering to:
- Provide initial assessment and possible workaround of production issue
- Troubleshoot and resolve production issues
- Work with Support and Development teams to:
- Identify and resolve issues
- Discuss and plan integration tasks
- Provide technical escalation support for other Infrastructure Operations teams
Required Technical and Professional Expertise
- Excellent written and verbal communication skills.
- Person willing to work in shift or take call out responsibility for production issues
- 8+ years’ experience in hands-on production administration of large systems and environment
- Experience establishing and improving procedures within a mission critical environment
- Must be efficient in writing and debugging scripts
- Must be extremely comfortable using and navigating within a Linux environment
- Ability to do low level debugging and problem analysis by examining logs and running Unix commands
- 5+ years of experience in Monitoring Technologies Virtualization Technologies and Automation / Configuration Managements
- Monitoring technologies: Zabbix (preferred) SysDIG Grafana Nagios Splunk etc. (at least one)
- Virtualization technologies: Citrix Xen Hypervisor (Preferred) KVM(also preferred) libvirt VMware vSphere etc. (at least one)
- Automation and configuration management tools/solutions: Ansible Salt Chef python bash etc. (at least one)
- Working knowledge with ServiceNow JIRA Confluence and GitHub
- Working knowledge with Container technologies: Kubernetes (preferred) Docker etc.
Preferred Technical and Professional Expertise
- Cloud Infra Services network Operator who is having hands on experience on any Cloud Infrastructure and any network architecture — whether public cloud private cloud hybrid cloud or multi cloud.
- Good experience in Public cloud platforms Kubernetes clusters and Strong Linux skills for managing services across microservices platform good SRE knowledge in Cloud Compute Storage and Networking and experience in 24/7 cloud operations and support environments.
Date Posted
08/28/2024
Views
0
Similar Jobs
Quality Engineer: Automation - IBM
Views in the last 30 days - 0
In this role youll work in one of IBMs Consulting Client Innovation Centers delivering deep technical and industry expertise to clients worldwide As a...
View DetailsDevOps Engineer - IBM
Views in the last 30 days - 0
The text is an invitation to join IBM where work is more than just a job Its a calling to build design code consult think along with clients sell make...
View DetailsLogic Design Engineer - IBM
Views in the last 30 days - 0
This job posting is for a Hardware Developer position at IBM where you will work on systems driving the quantum revolution and AI era The role involve...
View DetailsQuality Engineer: Middleware - IBM
Views in the last 30 days - 0
The role of a Test Specialist at IBM involves working in a delivery center using analytical and technical skills to ensure software quality The Middle...
View DetailsInfrastructure Engineer - IBM
Views in the last 30 days - 0
IBM Research is seeking a candidate with experience in implementing innovative solutions for resilient and robust computing environments focusing on I...
View DetailsSRE Engineer - IBM
Views in the last 30 days - 0
The IBM Cloud Networking Tribe is seeking a Software Engineering professional to build the next generation IAAS The role involves running the producti...
View Details