Site Reliability Engineer

IBM · IN Hyderabad

Company

IBM

Location

IN Hyderabad

Type

Full Time

Job Description

Introduction
At IBM work is more than a job โ€“ itโ€™s a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better but to attempt things youโ€™ve never thought possible. Are you ready to lead in this new era of technology and solve some of the worldโ€™s most challenging problems? If so letโ€™s talk.

Your Role and Responsibilities
Looking for 3+ years of experience candidate with the following experience

Your Role and Responsibilities

  • Monitoring the health of the IKS control plane and ensuring reliable operations
  • Responding promptly to production issues and alerts
  • Executing changes in the production environment through advanced automation
  • Partnering with other SRE teams and program managers to deliver mission-critical services
  • Supporting the development and enhancement of Platform-as-a-Service services
  • Implementing and automating solutions that support IBM Cloud products
  • Ensuring compliance and security integrity of the environment
  • Collaborating with Engineering to troubleshoot and resolve production issues
  • Providing technical escalation support for other Infrastructure Operations teams


Required Technical and Professional Expertise

  • Expertise in Kubernetes architecture including the latest features and security aspects
  • Strong debugging skills in Kubernetes environments.
  • Strong experience in programming with Python or Go with demonstrated ability to develop and maintain complex codebases.
  • Proficiency in network configuration and advanced monitoring solutions such as Prometheus SysDIG and Grafana
  • Experience in hands-on administration of cloud infrastructure particularly Kubernetes-based platforms.
  • Skills in performance tuning and optimization of Kubernetes clusters including resource quota management scaling and efficient use of underlying infrastructure.
  • Understanding of network protocols (TCP/IP HTTP etc.) and network configuration tools (e.g. CNI) specific to Kubernetes environments.
  • Deep understanding of Kubernetes security practices including network policies security contexts role-based access control (RBAC) and the secure handling of secrets.
  • Knowledge of automation and configuration management tools: Ansible Salt ChefTerraform
  • Strong Linux skills for managing services across a microservices platform
  • Ability to implement robust incident management strategies and frameworks
  • Experience in performance optimization of Kubernetes clusters
  • Understanding of disaster recovery planning and high availability setups in Kubernetes environments
  • Excellent written and verbal communication skills with a willingness to take on call-out responsibilities
  • Experience establishing and improving procedures within a mission-critical environment


Preferred Technical and Professional Expertise

  • Hands-on experience with any one of cloud infrastructures (IKS AWS Azure GCP) and integrating cloud services for storage security and databases
  • Knowledge of Slack bot automations for infra/cloud maintenance and SRE-based automations
  • Active participation in Kubernetes communities and forums
  • Vendor management skills to ensure optimal service levels and cost control
  • Ability to mentor and train teams on Kubernetes best practices and operational strategies
Apply Now

Date Posted

11/19/2024

Views

0

Back to Job Listings Add To Job List Company Profile View Company Reviews
Positive
Subjectivity Score: 0.8

Similar Jobs

High Speed IO Verification Engineer - IBM

Views in the last 30 days - 0

The High Speed IO design team is seeking a professional with experience in design verification particularly for IBM POWER systems and Z Mainframes pro...

View Details

Data Engineer: Enterprise Content Management - IBM

Views in the last 30 days - 0

This job posting is for a Data Engineer role at IBM Consulting The role involves harnessing the power of data to unveil captivating stories and intric...

View Details

Data Engineer: Data Platforms-AWS - IBM

Views in the last 30 days - 0

The role involves working in IBM Consulting Client Innovation Centers delivering technical expertise to clients and developing big data solutions The ...

View Details

Data Engineer: Data Platforms-AWS - IBM

Views in the last 30 days - 0

The role of Big Data Engineer in IBM Consulting involves developing and maintaining big data solutions working with clients to improve their hybrid cl...

View Details

Data Engineer: Business Intelligence - IBM

Views in the last 30 days - 0

The job description is for a Cognos Developer and Administrator to work in an IBM Consulting Client Innovation Center The role involves providing expe...

View Details

Full Stack Software Developer - IBM

Views in the last 30 days - 0

The text describes a job opening for a skilled backend developer in IBM Softwares Cloud Platform Services team The role involves designing developing ...

View Details