Site Reliability Engineer

SportyBet β€’ Worldwide

Company

SportyBet

Location

Worldwide

Type

Full Time

Job Description

Sporty's sites are some of the most popular on the internet, consistently staying in Alexa's list of top websites for the countries they operate in


In addition to our DevOps Team we are building a Site Reliability Team whose purpose is to focus on site reliability and security. It will also involved deployment, configuration, and monitoring, as well as the availability, latency, change management, emergency response, and capacity management of services in production.


Our Stack


  • Backend Application Framework: Spring Boot (Java Config + Embedded Tomcat)
  • Frontend Application Framework: VueJS
  • Micro Service Framework: Spring Cloud Dalston (Netflix Eureka + Netflix Eureka + Netflix Ribbon + Feign)
  • Database: AWS RDS, RDS Proxy, MONGODB
  • Public Cache: AWS ElastiCache + Redis
  • Message Queue: Apache RocketMQ, RabbitMQ
  • Distributed Scheduling: Dangdang Elastic Job
  • Data Index and Search: ElasticSearch
  • Log Real-time Visualization: ElasticSearch + Logstash + Kibana, Grafana Loki
  • Business Monitoring: Prometheus + Grafana
  • Reverse Proxy: Nginx
  • CDN: Cloudflare
  • Server Virtualization Container: AWS EKS + AWS EC2
  • Server Operation System: CentOS
  • Static File Storage: AWS S3
  • Inner DNS Resolution: AWS Route 53
  • Network Management: AWS VPC
  • Cluster Management and Scaling: AWS OpsWorks
  • Cluster Monitoring: Prometheus + AWS CloudWatch
  • HTTPS Certificate Management: AWS Certificate Manager
  • Malicious Attack Defending: AWS WAF & Shield
  • Cluster Alert: AWS SNS + Slack
  • Continuous Integration/Deployment: Jenkins, Rancher, ArgoCD
  • Configuration Tool: Ansible, Chef, Salt


Responsibilities


  • Work with a team of DevOps/SRE and DBA professionals
  • Improve existing infrastructure and processes in the 6 countries we’re currently deployed in as well as streamlining processes deploy to new countries in the future
  • Holistically improve all aspects of our current infrastructure including: reducing costs; streamlining environment provisioning; lowering response times and incorporating the latest techniques and technologies
  • Monitor and maintain the existing cloud infrastructure via autoscaling, automated alerts, andOpsWork and Grafana dashboards
  • Take ownership and responsibility for our cloud operation activities
  • Liaise with external security agencies for annual audits as well as perform our own internal security sweeps
  • Aid in reconfiguring existing architecture to allow for rapid deployments to new countries
  • Mentoring less experienced team members


Requirements


  • 3+ years SRE experience
  • Experience independently leading the planning and deployment of a project
  • Experienced with cloud platforms, especially AWS, including solid knowledge of how to utilize cloud resources to fulfill the demand from other teams and production
  • A sound understanding of modern Micro Services and Service Mesh concepts
  • Experience managing Kubernetes, including CI / CD with Kubernetes
  • Solid networking knowledge, especially the TCP / IP stack and HTTP protocol
  • A strong understanding of cache, including CDN, HTTP cache, Redis / Memcached
  • Excellent troubleshooting skills, including Linux OS issue diagnosis and OS parameter optimization, JVM optimization would be highly advantageous
  • Experienced with CloudNative Monitoring solution in Large distributed system using observation model


Benefits


Quarterly and flash bonuses

Flexible working hours

Top-of-the-line equipment

Education allowance

Referral bonuses

28 days paid annual leave

Annual company retreat - we all went to Dubai in 2022 and are planning 2 more retreats for 2023!

Highly talented, dependable co-workers in a global, multicultural organisation

Payment via DEEL, a world class online wallet systemΒ 

We score 100% on The Joel Test

Our teams are small enough for you to be impactful

Our business is globally established and successful, offering stability and security to our Team Members

Apply Now

Date Posted

01/27/2023

Views

5

Back to Job Listings ❀️Add To Job List Company Info View Company Reviews
Positive
Subjectivity Score: 0.8

Similar Jobs

Software Engineer C++ (Senior) - Apexver

Views in the last 30 days - 0

The role of a Senior Software Engineer at Apexver involves leading the design development and scaling of highperformance trading systems The position ...

View Details

Software Engineer, iOS Core Product - Speechify, Inc.

Views in the last 30 days - 0

Speechify is a texttospeech product that has gained significant traction with over 50 million users worldwide The company has recently been recognized...

View Details

The SafetyWing Digital Nomad Residency - SafetyWing

Views in the last 30 days - 0

SafetyWing offers a digital nomad residency program with up to 4000 reimbursement for travel accommodation and work tools emphasizing mentorship commu...

View Details

AI Trainer - Anuttacon

Views in the last 30 days - 0

The text describes a companys culture emphasizing creativity collaboration and impactful work It outlines a mission to create immersive virtual worlds...

View Details

Executive Assistant & Accountability Partner (Full‑Time, Remote, ET Hours) - N/A

Views in the last 30 days - 0

This job description outlines a remote Executive Assistant role requiring calendar management travel coordination family operations oversight and acco...

View Details

Inside Sales Contractor - Credit Wellness, LLC

Views in the last 30 days - 0

This job posting promotes a remote financial services sales role with competitive commissionbased compensation guaranteed training stipends and growth...

View Details