Cloud Platform Developer
Company
IBM
Location
Krakow, Poland
Type
Full Time
Job Description
Introduction
Watson Orders is a Silicon Valley based technology development group within IBM targeting the development of world-class conversational AI. Our mission is to deliver advanced technology solutions that address real-world, data driven needs in customer-facing the quick service restaurant, environment. We are focused on using state-of-the-art Machine Learning, AI, and related technologies to completely transform the customer experience
Focus on the role, not on IBM or business unit. Candidates can learn about the company from places other than the Job Description; tell them about the role and WHY they should want it.
Your Role and Responsibilities
We are currently looking for skilled Infrastructure Engineers to develop, maintain, and support
Want more jobs like this?
Get Software Engineering jobs in Krakow, Poland delivered to your inbox every week.

container orchestration (kubernetes), distributed ML workloads, network services, storage
layers, and petabyte scale AWS storage and Kafka stream stack.
Responsibilities:
- Develop and maintain scalable distributed systems in AWS
- Develop and maintain high performance k8s clusters across multiple regions
- Develop and maintain telemetry infrastructure & service instrumentation (python) for metrics, distributed tracing, and logging
- Support infrastructure for a petabyte scale data platform and stream analysis services
- Work with Audio and Speech AI Engineers to accelerate development and deployment of heterogeneous analysis and distributed training pipelines
- Participate in the definition and management of SLIs, SLOs and error budgets for infrastructure and production services
- Design and implement infrastructure-as-code pipelines
Required Technical and Professional Expertise
- AWS experience designing, implementing, and support cloud-based infrastructure
- Experience architecting, deploying, and supporting kubernetes in cloud environments
- Experience designing and supporting distributed systems
- Experience writing production code in one of more languages such as Python (preferred), Java, Go in a microservices environments
- Linux experience configuring, supporting, and optimizing
Preferred Technical and Professional Expertise
- Familiarity running distributed ML workloads in cluster orchestrated environments
- Experience building and supporting telemetry and related infrastructure (Open telemetry, Jaeger, Grafana, Prometheus)
- Experience designing and implementing infrastructure as code pipelines
- PubSub Experience (Kafka, SQS, SNS, MQTT)
- Experience designing and implementing traffic routing strategies in edge and microservices environments.
Date Posted
10/08/2024
Views
0
Similar Jobs
Technical Lead Manager, Android Kernel, Android Systems - Google
Views in the last 30 days - 0
View DetailsTechnical Program Manager, Android for Automotive - Google
Views in the last 30 days - 0
View Details