Site Reliability Engineer / Team Lead

Omilia · Argentina

Company

Omilia

Location

Argentina

Type

Full Time

Job Description

We are looking for an experienced Site Reliability Engineer / Team Lead to manage and coordinate a team of reliability engineers. Your primary responsibility will be ensuring the high availability, security, and reliability of our cloud platform. You will oversee incident resolution, change management, and team coordination, as well as training and developing team members.

Requirements

Change, Incident and Problem Management:

  • Oversee the resolution of complex incidents.
  • Coordinate with SREs to ensure timely incident resolution.
  • Optimize technical processes to improve change quality.
  • Track and report on incident resolution metrics.
  • Ensure that SLOs & SLIs are defined and maintained.

Customer Escalation Handling:

  • Handle escalated incidents and ensure timely resolution.
  • Ensure customer satisfaction with the resolution process.

Team Coordination:

  • Assign tasks and tickets to team members.
  • Ensure proper documentation of incidents and resolutions.
  • Provide guidance and support to SREs.
  • Advise Platform Team Members plus other Stakeholders with Omilia Best Practices.

Quality Assurance:

  • Review and ensure the quality of incident responses and solutions.
  • Conduct regular audits of incident reports and resolutions.

Training and Development:

  • Develop training programs for new and existing team members.
  • Conduct knowledge-sharing sessions.

Platform Security, High Availability and Reliability:

  • Drive the design and development of the SRE infrastructure(Dev, Staging, PreProd Environments included as well) and maintenance tools for the full lifecycle of system development.
  • Disaster Recovery, Backup Strategy for data integrity and business continuity.
  • Provisioning, configuration, and scaling for efficiency and consistency.
  • Continuous Improvements of Omilia Cloud Platform.

Experience Required:

  • 5-7 years of experience in SRE or related roles.
  • Experience in large-scale system architecture and automation.
  • Experience in a leadership role is an advantage.
  • Bachelor’s degree in Computer Science, Engineering, or related field.

Must-have:

  • AWS
  • Azure (a plus)
  • Kubernetes
  • Docker
  • Terraform
  • Ansible

Nice-to-have:

  • Python
  • Git
  • Shell scripting
  • Linux
  • SQL

Benefits

  • Fixed compensation;
  • Long-term employment with the working days vacation;
  • Development in professional growth (courses, training, etc);
  • Being part of successful cutting-edge technology products that are making a global impact in the service industry;
  • Proficient and fun-to-work-with colleagues;
  • Apple gear.

Omilia is proud to be an equal opportunity employer and is dedicated to fostering a diverse and inclusive workplace. We believe that embracing diversity in all its forms enriches our workplace and drives our collective success. We are committed to creating an environment where everyone feels welcomed, valued, and empowered to contribute their unique perspectives without regard to factors such as race, color, religion, gender, gender identity or expression, sexual orientation, national origin, heredity, disability, age, or veteran status, all eligible candidates will be given consideration for employment.

Apply Now

Date Posted

12/24/2024

Views

0

Back to Job Listings Add To Job List Company Profile View Company Reviews
Positive
Subjectivity Score: 0.9

Similar Jobs

Principal Software Engineer - Webflow

Views in the last 30 days - 0

Webflow is seeking a Principal Engineer in Argentina to help establish an international engineering hub emphasizing impact on technical foundation cul...

View Details

Senior Database Engineer - Webflow

Views in the last 30 days - 0

Webflow seeks a Senior Database Engineer to join their remotefirst team emphasizing database optimization collaboration with product engineers and con...

View Details

QA Automation Engineer - Masabi

Views in the last 30 days - 0

Masabi promotes their mission to revolutionize fare payment systems globally highlighting innovative platforms and job opportunities for a Backend Tes...

View Details

Lead Fraud Operations Analyst - Apollo.io

Views in the last 30 days - 0

This job description outlines a Sr Fraud Operations Analyst role requiring expertise in fraud investigations SQL and crossteam collaboration The posit...

View Details

Principal Software Engineer - Webflow

Views in the last 30 days - 0

Webflow is seeking a Principal Engineer in Argentina to help establish an international engineering hub emphasizing innovation scalability and inclusi...

View Details

Senior Analytics Engineer - Trafilea

Views in the last 30 days - 0

Trafilea is a consumer tech platform with 1B revenue and 12M customers seeking a Senior Analytics Engineer The role involves building data models work...

View Details