Site Reliability Engineer

IBM · IN Bangalore

Company

IBM

Location

IN Bangalore

Type

Full Time

Job Description

Introduction
A career in IBM Cloud means you’ll be part of a team that transforms our customers challenges into solutions.
Seeking new possibilities and always staying curious we are a team dedicated to creating the world’s leading AI-powered cloud-native software solutions for our customers. Our renowned legacy creates endless global opportunities for our IBMers so the door is always open for those who want to grow their career.
IBM’s product and technology landscape includes Research Software and Infrastructure. Entering this domain positions you at the heart of IBM where growth and innovation thrive.

Your Role and Responsibilities
As a Site Reliability Engineer you will work in an agile collaborative environment to build deploy configure and maintain systems for the IBM client business. In this role you will lead the problem resolution process for our clients from analysis and troubleshooting to deploying workarounds or fixes.

Your primary responsibilities include:

  • Monitoring the health of the IKS control plane and ensuring reliable operations
  • Responding promptly to production issues and alerts
  • Executing changes in the production environment through advanced automation
  • Partnering with other SRE teams and program managers to deliver mission-critical services
  • Supporting the development and enhancement of Platform-as-a-Service services
  • Implementing and automating solutions that support IBM Cloud products
  • Ensuring compliance and security integrity of the environment
  • Collaborating with Engineering to troubleshoot and resolve production issues
  • Providing technical escalation support for other Infrastructure Operations teams


Required Technical and Professional Expertise

  • 2+ years of IT experience.
  • Expertise in Kubernetes architecture including the latest features and security aspects
  • Strong debugging skills in Kubernetes environments.
  • Strong experience in programming with Python or Go with demonstrated ability to develop and maintain complex codebases.
  • Proficiency in network configuration and advanced monitoring solutions such as Prometheus SysDIG and Grafana
  • Experience in hands-on administration of cloud infrastructure particularly Kubernetes-based platforms.
  • Skills in performance tuning and optimization of Kubernetes clusters including resource quota management scaling and efficient use of underlying infrastructure.
  • Understanding of network protocols (TCP/IP HTTP etc.) and network configuration tools (e.g. CNI) specific to Kubernetes environments.
  • Deep understanding of Kubernetes security practices including network policies security contexts role-based access control (RBAC) and the secure handling of secrets.
  • Knowledge of automation and configuration management tools: Ansible Salt ChefTerraform
  • Strong Linux skills for managing services across a microservices platform
  • Ability to implement robust incident management strategies and frameworks
  • Experience in performance optimization of Kubernetes clusters
  • Understanding of disaster recovery planning and high availability setups in Kubernetes environments
  • Excellent written and verbal communication skills with a willingness to take on call-out responsibilities
  • Experience establishing and improving procedures within a mission-critical environment


Preferred Technical and Professional Expertise

  • Hands-on experience with any one of cloud infrastructures (IKS AWS Azure GCP) and integrating cloud services for storage security and databases
  • Knowledge of Slack bot automations for infra/cloud maintenance and SRE-based automations
  • Active participation in Kubernetes communities and forums
  • Vendor management skills to ensure optimal service levels and cost control
  • Ability to mentor and train teams on Kubernetes best practices and operational strategies
Apply Now

Date Posted

06/24/2024

Views

1

Back to Job Listings Add To Job List Company Profile View Company Reviews
Positive
Subjectivity Score: 1

Similar Jobs

Quality Engineer: Automation - IBM

Views in the last 30 days - 0

In this role youll work in one of IBMs Consulting Client Innovation Centers delivering deep technical and industry expertise to clients worldwide As a...

View Details

DevOps Engineer - IBM

Views in the last 30 days - 0

The text is an invitation to join IBM where work is more than just a job Its a calling to build design code consult think along with clients sell make...

View Details

Logic Design Engineer - IBM

Views in the last 30 days - 0

This job posting is for a Hardware Developer position at IBM where you will work on systems driving the quantum revolution and AI era The role involve...

View Details

Quality Engineer: Middleware - IBM

Views in the last 30 days - 0

The role of a Test Specialist at IBM involves working in a delivery center using analytical and technical skills to ensure software quality The Middle...

View Details

Infrastructure Engineer - IBM

Views in the last 30 days - 0

IBM Research is seeking a candidate with experience in implementing innovative solutions for resilient and robust computing environments focusing on I...

View Details

SRE Engineer - IBM

Views in the last 30 days - 0

The IBM Cloud Networking Tribe is seeking a Software Engineering professional to build the next generation IAAS The role involves running the producti...

View Details