Lead Site Reliability Engineer
Company
Masabi
Location
Colombia
Type
Full Time
Job Description
Introducing Masabi
// At Masabi we’re driving the fare payment revolution powering the journeys of millions all over the world. We build fare collection platforms that allow riders to seamlessly buy and present tickets for public transport either on their mobile phones from a ticket machine or even by tapping their bank card to travel.
Our Justride platform is used in over 250 locations globally including some of the largest cities in the world. With our industry-first mobile ticketing SDK we’ve partnered with large players in the transport space including Uber Moovit and Transit.
Your own journey is important to us too. Choosing a role here means joining a network of innovators from all walks of life; a group of passionate individuals who consistently deliver. Here you’ll find the tools you need to build the career you want. Whether you’re taking the direct route or trying a new path we’ll support you no matter what.
The Role_ // We’re looking for a Lead Site Reliability Engineer to join our platform team someone who’s confident working hands-on with infrastructure but also ready to shape how we scale and operate as a global team.
You’ll take ownership of key systems lead cross-functional work and help evolve the way we build for performance reliability and security. This role is ideal for those who enjoy solving complex problems improving systems through automation and supporting others as they grow. It’s a chance to have both technical depth and meaningful influence while staying close to the work that matters.
Location_ This role is available in a remote model to candidates based in Colombia.
What You’ll Be Doing_
Build and automate reliable systems
-
Lead design discussions and make key architectural decisions for reliability scalability and performance.
-
Establish SRE standards and best practices (IaC patterns CI/CD maturity observability etc.) across teams.
-
Design and manage infrastructure using Terraform and CloudFormation
-
Build and evolve CI/CD pipelines that support fast safe and frequent deployments
-
Automate manual tasks to reduce operational load and enable faster delivery
-
Help expand our infrastructure globally scaling up new environments with care
Improve visibility scale and performance
-
Define and maintain SLIs SLOs and alerting strategies aligned with user experience
-
Implement monitoring solutions that give us clear early signals during incidents
-
Lead capacity planning and performance tuning as our systems and teams grow
-
Identify opportunities to improve architecture for resilience and cost-effectiveness
Own reliability and incident response
-
Lead or contribute to incident response root cause analysis and post-incident reviews
-
Design and maintain disaster recovery and failover strategies
-
Partner with compliance and security teams to meet frameworks like SOC 2 and PCI
Support others and share your knowledge
-
Collaborate with engineers architects and product teams to embed SRE practices from the start and define long-term platform reliability strategy
-
Mentor others in areas like observability incident readiness and infrastructure-as-code
-
Document systems and processes clearly to support learning and long-term success
-
Partake of the on-call rotation shared with the team and paid on top of salary
About You_
// You’re an experienced SRE who combines technical depth with curiosity care and a desire to make things better for the platform the team and the people using our systems.
-
You’ve worked in SRE platform or DevOps roles where reliability was business-critical (24/7)
-
You have proven experience designing and evolving production-grade systems for scale and resilience.
-
You’re comfortable designing and operating in AWS with strong knowledge of cloud architecture networking and security (VPC design IAM least privilege)
-
You have hands-on experience with Terraform infrastructure automation and CI/CD systems
-
You’ve led or contributed to high-impact projects involving observability performance incident command and/or reliability (distributed tracing log correlation metrics maturity etc)
-
You communicate clearly and drive cross-functional reliability improvements in distributed async-first teams
-
You enjoy helping others grow and value a kind collaborative engineering culture
-
You take pride in doing things the right way but you’re pragmatic and focused on impact
Nice To Have_
-
Familiarity with PCI DSS v4 or similar compliance standards
-
Experience with container orchestration
-
AWS certifications
Our Tech Stack_
// Our platform is JVM-based and cloud-native running on AWS. The SRE team works across both modern infrastructure and legacy systems as we continue to scale globally.
We use a range of proven tools to support performance reliability and speed of delivery:
-
Monitoring & Observability: Grafana Prometheus CloudWatch Pingdom Kibana
-
Infrastructure as Code: Terraform CloudFormation
-
CI/CD & Automation: GitLab CI Rundeck
-
Configuration Management & Logging: Puppet Confluent Cloud
Some of our benefits_
-
Competitive salary package
-
15 days paid vacation for each year plus 18 public holidays
-
Private Healthcare
-
Monthly team bonding allowance
-
Menopause support
-
Choice of a workstation
-
Ability to work for up to 3 months per year from any country in the world
-
Fun and collaborative environment with a focus on making a difference in the world
// In addition to the above as an employee you will also have access to a training allowance of up to $750 USD and $250 USD to spend on your home office every year.
Careers at Masabi are for people going places - driven by a mission to make transit fair and accessible for all.
We are a network of innovators from all walks of life passionate about making a difference. At Masabi we operate with openness and trust creating an environment where everyone feels empowered to bring their whole authentic selves to work.
Whoever you are just be yourself. We welcome applications from underrepresented backgrounds and encourage you to share your pronouns at any stage. Together we simplify journeys remove barriers and improve daily life for millions.
Why Join Masabi?
-
Driven by Purpose – We believe in journeys made simple. The work isn’t always easy but the best things never are.
-
Encouraged to Accelerate – Masabi is going places and our people are in the driving seat. Whether you’re taking the direct route or exploring new paths we support your journey.
-
Advancing with Empathy – We put people first and foster a culture of learning not blame. No matter your cargo we share the load.
We’re already powering journeys - are you ready to join us?
Date Posted
11/25/2025
Views
0
Similar Jobs
NOC Team Lead - Twilio
Views in the last 30 days - 0
The job posting is for a NOC Team Lead position at Twilio to monitor carrier partner network performance and troubleshoot issues across Voice SMS and ...
View DetailsLead Solutions Architect - HubSpot
Views in the last 30 days - 0
HubSpot is seeking a Lead Solutions Architect to define technical standards and architectural best practices for scaling companies The role involves s...
View DetailsApplication Integration Engineer - Twilio
Views in the last 30 days - 0
The job posting is for an Application Integration Engineer position at Twilio focusing on designing and implementing integrations using Informatica II...
View DetailsBilingual Team Lead - Public Records - First Advantage
Views in the last 30 days - 0
The text describes a job role at First Advantage emphasizing team leadership compliance and process improvement It highlights the companys commitment ...
View DetailsQA Automation Engineer - Masabi
Views in the last 30 days - 0
Masabi is seeking a Backend Test Automation Engineer to join their innovative team driving fare payment solutions globally The role involves testing d...
View DetailsCloud Network Engineer (Palo Alto) - Blossom
Views in the last 30 days - 0
Blossom is seeking a Cloud Network Engineer to architect and maintain their cloud network infrastructure focusing on AWS environments and Palo Alto Ne...
View Details