A career in IBM Software means you’ll be part of a team that transforms our customer’s challenges into solutions.
Seeking new possibilities and always staying curious we are a team dedicated to creating the world’s leading AI-powered cloud-native software solutions for our customers. Our renowned legacy creates endless global opportunities for our IBMers so the door is always open for those who want to grow their career.
IBM’s product and technology landscape includes Research Software and Infrastructure. Entering this domain positions you at the heart of IBM where growth and innovation thrive.
Databases and event streams are complementary infrastructure in modern software architecture. You will be highly involved in the design implementation and operation of Astra Streaming (Pulsar) and our mission to enable the world’s leading enterprises as we scale up and deliver an amazing developer experience. You will also be responsible for helping us ensure high uptimes and satisfied customers across our various production and non-production environments.
What you will do:
* Ensure production stability and high-uptimes and assist debugging and root causing user-facing issues
* Contribute to open-source and proprietary projects that interface with Pulsar
* Perform software upgrades and configuration updates in a production environment
* Perform security analysis and apply changes to comply with security policies
* Maintain monitoring systems configure alerting and log collection.
* Work in a fast-moving environment to rapidly prototype iterate and evolve solutions for real-world developer need
* Perform regular code reviews among peers
* 4 - 6 years of relevant experience
* Systems level proficiency in Java Golang or another popular language.
* Experience working on and operating large scale distributed production systems
* Kubernetes (EKS AKS GKE) Helm and CRD’s (Operators)
* Infrastructure as Code CI/CD (ArgoCD) Jenkins or similar
* Metrics Alerting and Logging Grafana Prometheus Splunk
* Knowledge of highly scalable services that achieve massive scalability and availability
* Cloud Infrastructure Providers GCP Azure AWS or similar.
* Experience in SDLC having contributed at each step: Plan Track Code Build Test Deploy and Monitor
* Experience maintaining a production Apache Pulsar or Kafka cluster is a plus.
* Experience with Prometheus and either Thanos or another metrics aggregation is a plus.
* Experience with Terraform is a plus
* Experience with Apache Cassandra is a plus.