Job Description
Roadie a UPS Company is a logistics management and crowdsourced delivery platform. Founded in 2014 Roadie offers businesses fast flexible and asset-light logistics solutions for last-mile delivery. Roadie enables local delivery to more than 95% of U.S. households by providing access to more than 200000 independent drivers nationwide – allowing businesses to offer their customers delivery optionality for almost any industry from airlines to artisans.
Roadie is seeking a Site Reliability Engineer to join our growing Technical Operations Team. We are looking for a candidate who has experience implementing site reliability principals as well as production level Kubernetes experience. The ideal candidate is a skilled problem solver with intimate knowledge of site reliability practices standard Dev Ops principles AWS scripting languages and Kubernetes.
What You'll Do
-
Maintain support and engineer production and nonproduction Kubernetes Clusters
-
Deploy and maintain monitoring and logging solutions based on Prometheus Thanos and Loki
-
Work directly with Development teams to foster site reliability principals
-
Define and manage SLO SLI and error budgets
-
Build automation and tooling to “eliminate toil”
-
Capacity planning and cost optimization
-
Debug production/non-production issues
-
Take part in 24/7 on-call rotation
Technology We're Using Now
-
Python Ruby on Rails Golang
-
Postgres Redshift Redis Kafka
-
AWS GCP
-
Docker/Kubernetes
-
Prometheus/Thanos/Loki/Grafana
-
Istio Karpenter Keda
-
Git/CircleCI
-
ArgoCD
-
Terraform/Crossplane
What You Bring
-
2+ years in various SRE roles
-
4+ years in various DevOPS/System Engineering roles
-
2+ years of experience building and managing production Kubernetes infrastructure with emphasis on the use of *nix and cloud vendor Kubernetes solutions (EKS GKE)
-
4+ Years experience with popular scripting languages (Python Ruby Bash)
-
Experience with Infrastructure as code such as Terraform
-
Experience with CI/CD Development tools (CircleCI)
-
Experience with GitOPS Tools (Argocd)
-
Experience using a broad range of AWS technologies (RDS ElasticSearch VPC EKS S3 CloudFront MSK Elasticache CloudWatch)
-
Must be able to work independently be self-motivated and handle multiple priorities
-
Comfortable working in a fast-paced agile environment
-
Finally a willingness to admit what you don’t know and learn what you need to learn quickly
Why Roadie?
-
Competitive compensation packages
-
100% covered health insurance premiums for yourself
-
401k with company match
-
Tuition and student loan repayment assistance (that’s right - Roadie will contribute directly to your existing student loans!)
-
Flexible work schedule with unlimited PTO
-
Monthly 3-day weekends
-
Monthly WFH stipend
-
Paid sabbatical leave- tenured team members are given time to rest relax and explore
-
The technology you need to get the job done
Explore More
Date Posted
04/15/2024
Views
14