Job Description
Overview
In this role you will be part of a team that develops and supports the Apptio Kubernetes Platform (AKP)
where all Apptio applications are deployed. In a typical day you will interact with Github Linux
Kubernetes ArgoCD Docker Confluence Jira Slack and AWS.
You Are
You are passionate about problem solving and reliability and have experience in SRE or an adjacent role.
Your team can count on you to solve challenging problems across the entire Apptio Portfolio. You
collaborate with other SREs developers and support teams to help provide value to the broader
organization. You take responsibility when fixing problems in an automated code first way and are happy
to step outside your comfort zone to develop your skillset.
You Aren’t
A Kubernetes or cloud expert with many years of experience. This is an intermediate position; we want
you to help us and we also want to help you grow.
Us
The Platform and Site Reliability Engineering team – PRE – at Apptio is responsible for enhancing and
maintaining our Kubernetes platform and driving the adoption of SRE best practices across our
engineering teams. We are a distributed team working across three locations including the United States Poland and Australia.
Your Role and Responsibilities
• Manage deployments of Apptio services to AKP
• Streamline the deployment process
• Improve observability of the services within your purview by reviewing KPI dashboards and
alerting
• Author and maintain documentation of deployment and monitoring processes
• Use runbooks to troubleshoot and triage production issues
• Detect issues and handle Tier 1-2 troubleshooting
• Participate in online “swarm” collaboration sessions
• Collaborate with service developers
• Participate in on-call rotation
• Perform maintenance of the platform (patching resets upgrades etc.)
Required Technical and Professional Expertise
• 1+ years’ experience in an SRE or adjacent role
• Foundational understanding of at least one programming language and source control
(Preferably Golang)
• Practical experience with distributed application deployment and management
• Practical experience with container technologies (e.g. Kubernetes Docker)
• Practical experience with Infrastructure-as-code (IaC) – Terraform Cloud Formation Ansible
etc
• Experience with cloud provider services such as AWS Azure or Google Cloud Platform
• Familiarity with RESTful systems and their APIs
• Demonstrated fluency with the English language
Preferred Technical and Professional Expertise
• 2+ years’ experience in an SRE or adjacent role
• Familiarity with Apptio and IBM product offerings
Date Posted
11/07/2024
Views
0
Similar Jobs
Senior Site Reliability Engineer - IBM
Views in the last 30 days - 0
The role is for a Site Reliability Engineer to develop and support the Apptio Kubernetes Platform requiring experience in SRE problemsolving and colla...
View DetailsSite Reliability Engineer II - Apptio - IBM
Views in the last 30 days - 0
The job description is for a Site Reliability Engineer at Apptio Targetprocess The role involves ensuring the companys infrastructure and applications...
View DetailsAI/ML Staff Software Development Engineer - Apptio - IBM
Views in the last 30 days - 0
The job posting is for a Staff AIMLOps Development Engineer at Apptio responsible for designing and engineering efficient and resilient MLOps platform...
View DetailsStaff Backend Software Development Engineer - Apptio - IBM
Views in the last 30 days - 0
The job posting is looking for a seasoned software engineer with experience in building scalable microservices and handling massive amounts of data Th...
View DetailsApptio - Software Development Engineer I - IBM
Views in the last 30 days - 0
The text describes a job opportunity at IBM highlighting the companys focus on innovation collaboration and delivering elegant solutions to complex bu...
View DetailsSenior Software Development Engineer, Apptio - IBM
Views in the last 30 days - 0
The job posting is for a senior software engineer position at Apptio an IBM company The role involves working on a highperforming crossfunctional team...
View Details