IBM Infrastructure is a catalyst that makes the world work better because our clients demand it. Heterogeneous environments the explosion of data digital automation and cybersecurity threats require hybrid cloud infrastructure that only IBM can provide.
Your ability to be creative a forward-thinker and to focus on innovation that matters is all support by our growth minded culture as we continue to drive career development across our teams. Collaboration is key to IBM Infrastructure success as we bring together different business units and teams that balance their priorities in a way that best serves our client's needs.
IBM's product and technology landscape includes Research Software and Infrastructure. Entering this domain positions you at the heart of IBM where growth and innovation thrive.
We are looking for a passionate and skilled DevOps Engineer to join our AI &
Microservices team. You will work closely with backend engineers and ML
practitioners to design automate and optimize infrastructure for
high-performance APIs and AI workloads. This role demands a strong foundation
in cloud-native DevOps practices container orchestration and modern
deployment strategies.
5+ year of experience in Design and maintain CI/CD pipelines for Python/Golang-based microservices.
Automate infrastructure provisioning using tools like Terraform or Helm.
Manage the infra using Openshift and KVM technologies
Manage containerized workloads using Docker and Kubernetes.
Support APIs using HTTP gRPC and WebSocket protocols.
Collaborate with developers working on Retrieval-Augmented Generation (RAG) LangChain and other AI frameworks.
Monitor and optimize performance of LLM-based services.
Implement observability tools for logging metrics and tracing.
Ensure high availability scalability and security of production systems.
Containers & Orchestration: Proficiency in Openshift Docker and Kubernetes.
Programming & API Knowledge: Familiarity with Python/Golang APIs and microservices architecture.
AI Frameworks: Understanding of LangChain RAG and LLMs (Large Language Models).
Protocols: Experience with HTTP gRPC WebSocket.
CI/CD & Automation: Hands-on with Jenkins GitHub Actions GitLab CI or similar.
Infrastructure as Code: Terraform Helm or Ansible.
Monitoring & Logging: Prometheus Grafana ELK Stack or similar tools.
Cloud Platforms: AWS GCP or Azure.