Senior Site Reliability Engineer

IBM · US Bellevue

Company

IBM

Location

US Bellevue

Type

Full Time

Job Description

Introduction
Overview
In this role you will be part of a team that develops and supports the Apptio Kubernetes Platform (AKP)
where all Apptio applications are deployed. In a typical day you will interact with Github Linux
Kubernetes ArgoCD Docker Confluence Jira Slack and AWS.

You Are

You are passionate about problem solving and reliability and have significant experience in SRE or an
adjacent role. Your team can count on you to solve challenging problems across the entire Apptio
Portfolio. You collaborate with other SREs developers and support teams to help provide value to the
broader organization. You take responsibility when fixing problems in an automated code first way and are
happy to step outside your comfort zone to develop your skillset. You are a mentor to other engineers and
able to assist Management in key decision making.

Us

The Platform and Site Reliability Engineering team – PRE – at Apptio is responsible for enhancing and
maintaining our Kubernetes platform and driving the adoption of SRE best practices across our
engineering teams. We are a distributed team working across three locations including the United States
Poland and Australia.

Your Role and Responsibilities
β€’ Manage deployments of Apptio services to AKP
β€’ Streamline the deployment process
β€’ Improve observability of the services within your purview by reviewing KPI dashboards and alerting
β€’ Mentor junior to mid-level engineers
β€’ Author and maintain documentation of deployment and monitoring processes
β€’ Write and use runbooks to troubleshoot and triage production issues
β€’ Detect issues and handle Tier 3 troubleshooting
β€’ Drive online β€œswarm” collaboration sessions
β€’ Collaborate with service developers
β€’ Participate in on-call rotation
β€’ Perform maintenance of the platform (patching resets upgrades etc.)
β€’ Operate independently and own end-to-end delivery of solutions
β€’ Have significant input in the product roadmap and be able to articulate effectively the benefits of alternative technologies


Required Technical and Professional Expertise
β€’ 5+ years’ experience in an SRE or adjacent role
β€’ Functional understanding of at least one programming language and source control (Preferably
Golang)
β€’ Expertise with distributed application deployment and management via Kubernetes
β€’ Expertise with container technologies (e.g. Kubernetes Docker)
β€’ Expertise with Infrastructure-as-code (IaC) concepts (Terraform)
β€’ Expertise with cloud provider services preferably AWS
β€’ Ability to work with RESTful systems and their APIs
β€’ Familiarity with observability (e.g. Prometheus Open telemetry)
β€’ Demonstrated fluency with the English language skills


Preferred Technical and Professional Expertise

7+ years’ experience in an SRE or adjacent role
Apply Now

Date Posted

11/07/2024

Views

0

Back to Job Listings Add To Job List Company Profile View Company Reviews
Positive
Subjectivity Score: 0.8

Similar Jobs

Site Reliability Engineer II - IBM

Views in the last 30 days - 0

The role is for a team member to develop and support the Apptio Kubernetes Platform interacting with various tools and collaborating with other teams ...

View Details

Site Reliability Engineer II - Apptio - IBM

Views in the last 30 days - 0

The job description is for a Site Reliability Engineer at Apptio Targetprocess The role involves ensuring the companys infrastructure and applications...

View Details

Senior Software Development Engineer, Apptio - IBM

Views in the last 30 days - 0

The job posting is for a senior software engineer position at Apptio an IBM company The role involves working on a highperforming crossfunctional team...

View Details

AI/ML Staff Software Development Engineer - Apptio - IBM

Views in the last 30 days - 0

The job posting is for a Staff AIMLOps Development Engineer at Apptio responsible for designing and engineering efficient and resilient MLOps platform...

View Details

Staff Backend Software Development Engineer - Apptio - IBM

Views in the last 30 days - 0

The job posting is looking for a seasoned software engineer with experience in building scalable microservices and handling massive amounts of data Th...

View Details

Apptio - Senior Product Manager - IBM

Views in the last 30 days - 0

IBM Cloudability is looking for a product manager to lead the strategy and execution of their commitment automation platform The role requires a custo...

View Details