Senior Site Reliability Engineer
Job Description
Roadie is seeking a Site Reliability Engineer to join our growing Technical Operations Team. We are looking for a candidate who has experience implementing site reliability principals, as well as production level Kubernetes experience. Â The ideal candidate is a skilled problem solver with intimate knowledge of site reliability practices, standard dev ops principles, AWS, scripting languages and Kubernetes.Â
What You'll Be Doing
- Maintain, support, and engineer production and non-production Kubernetes Clusters.
- Deploy and maintain monitoring and logging solutions based on Prometheus, Thanos and Loki.
- Work directly with Development teams to foster site reliability principals.
- Define and manage SLO, SLI and error budgets
- Build automation and tooling to “eliminate toil”
- Capacity planning, and cost optimization.
- Debug production / non-production issues.
- Take part in 24/7 on-call rotation.
Technology We're Using Now
- Python, Ruby on Rails, Golang
- React/Redux, Objective-C and Swift, Android
- Postgres, Redshift, Redis, Kafka
- AWS
- Docker/Kubernetes
- Prometheus/Thanos/Loki/Grafana
- Git/CircleCI
- ArgoCD
What You Bring
- 4+ Years in various SRE roles
- 6+ Years in various DevOPS/System Engineering roles
- 4+ Years of experience building and managing production Kubernetes infrastructure with emphasis on the use of *nix and cloud vendor Kubernetes solutions (EKS, GKS, etc.)
- 6+ Years experience with popular scripting languages (Python, Ruby, Bash, etc.)
- Experience with Automation/Config management tools such as ansible/chef/puppet
- Experience with CI/CD Development tools (CircleCI, Jenkins, etc.)
- Experience with GitOPS Tools (Argocd / Weaveworks, etc)
- Experience using a broad range of AWS technologies (VPC, EKS, S3, CloudFront, MSK, Elasticache, CloudWatch, etc.)
- Must be able to work independently, be self-motivated and handle multiple priorities
- Comfortable working in a fast-paced agile environment
Finally, a willingness to admit what you don’t know, and learn what you need to learn quickly
Why Roadie?Â
- Competitive compensation packagesÂ
- 100% covered health insurance premiums for yourself
- 401k with company match
- Tuition and student loan repayment assistance (that’s right - Roadie will contribute directly to your existing student loans!)Â
- Flexible work schedule with unlimited PTOÂ
- Monthly 3-day weekends
- Monthly WFH stipendÂ
- The technology you need to get the job done
Date Posted
09/09/2022
Views
5
Similar Jobs
Senior Software Engineer (Java) - NCR Corporation
Views in the last 30 days - 6
NCR Corporation is a leading software and servicesled enterprise provider in the financial retail and hospitality industries They are looking for a Se...
View DetailsAPI Software Development Engineer - II - Synchrony
Views in the last 30 days - 6
The job description is for an API Software Development Engineer II at Synchrony The role involves working on microservice APIs participating in hackat...
View DetailsAPI Software Development Engineer - I - Synchrony
Views in the last 30 days - 5
The job description is for an API Software Development Engineer I position at Synchrony The role involves working on microservice APIs participating ...
View DetailsSenior Software Engineering Manager - NCR Corporation
Views in the last 30 days - 5
NCR Corporation is a leader in transforming connecting and running technology platforms for selfdirected banking stores and restaurants They are looki...
View DetailsSr. Data Analyst/Engineer - Remote - Sharecare
Views in the last 30 days - 11
Sharecare is a digital health company that helps people manage their health They are seeking a Sr Data AnalystEngineer to contribute to a new platform...
View DetailsSenior Product Manager - Client - CharterUP
Views in the last 30 days - 6
CharterUP is a leading charter bus platform aiming to disrupt the massive and fragmented bus industry by using proprietary technology to connect bus c...
View Details