We are looking for a dynamic Site Reliability Engineer to join our Cloud PaaS Operations Team in Dublin Ireland who is responsive to market needs to deliver value to our clients in a fast-changing cloud landscape. Â The SRE team dedicated to ensuring that the IBM Cloud is at the forefront of cloud technology from data center design Storage & Network architecture and compute clusters to flexible infrastructure services. We are building IBM's next generation cloud platform to deliver performance and predictability for our customers' most demanding workloads at global scale and with leadership efficiency resiliency and security. It is an exciting time and as a team we are driven by this incredible opportunity to thrill our clients.
Software Developers at IBM are the backbone of our strategic initiatives to design code test and provide industry-leading solutions that make the world run today. At IBM you will use the latest software development tools techniques and approaches and work with leading minds in the industry to build solutions you can be proud of.
Are you passionate about technology? Do you love building new things? Do you want to develop the future of IBM's Cloud offerings? If you answered YES then we have the right opportunity for you!
The shift toward the consumption of IT as a service i.e. the cloud is one of the most important changes to happen to our industry in decades. At IBM we are driven to shift our technology to an as-a-service model and to help our clients transform themselves to take full advantage of the cloud. With industry leadership in analytics security commerce and cognitive computing and with unmatched hardware and software design and enterprise reach no other company is as well positioned to address the full opportunity of cloud computing.
In this Event Streams Site Reliability Engineer role you will work closely with several Data Centers the entire Cloud organization and IBM vendors to support maintain and operationally improve the IBM cloud infrastructure. Â You will focus on the following key responsibilities:
1+ year of SRE or devops experience
Cloud Environment (Kubernetes Openshift VPC)
Golang (prefered language) or Java
Kafka
Terraform/Ansible
CI/CD (Jenkins/Github)
· Good written and verbal communication skills
· Experience in hands-on production administration of large systems and environment
· Experience establishing and improving procedures within a mission critical environment
· Must be comfortable in writing and debugging scripts
· Must be comfortable using and navigating within a Linux environment
· Ability to do debugging and problem analysis by examining logs and running Unix commands
· Experience in Monitoring Technologies Automation / Configuration and PagerDuty rota/on call
· Working knowledge with ServiceNow and GitHub