Platform Engineer - (Site Reliability Engineering)

Jobgether · Brazil

Company

Jobgether

Location

Brazil

Type

Full Time

Job Description

Team: IT

This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Platform Engineer - (Site Reliability Engineering) based in Brazil.

This is a high-impact SRE-focused platform engineering role centered on reliability, automation, and incident excellence within a large-scale, high-availability technology environment. You will own the full incident lifecycle, from real-time response and stakeholder coordination to deep post-incident analysis and long-term systemic fixes. The role is designed for an engineer who thrives under pressure, enjoys solving complex infrastructure problems, and is passionate about eliminating operational toil through automation and tooling. You will work closely with engineering squads across the organization to strengthen observability, improve platform resilience, and ensure production systems remain stable at scale. This is a highly collaborative and fast-paced environment where reliability is treated as a core product capability, not just a support function.

Accountabilities:

Own and drive end-to-end incident management processes, ensuring rapid response, clear communication, and effective resolution during production incidents.

  • Lead on-call operations, including incident triage, escalation, coordination, and stakeholder communication across severity levels
  • Design and implement automation to improve postmortem workflows, including tracking action items, ownership, and remediation follow-ups
  • Build tooling and AI-assisted workflows to reduce operational toil and accelerate incident detection, response, and resolution
  • Improve observability systems, including dashboards, alerting strategies, and monitoring signals across distributed systems
  • Conduct post-incident analysis to identify root causes and implement long-term reliability improvements
  • Collaborate with engineering teams to define preventive measures, improve runbooks, and reduce recurring incidents
  • Support change and deployment processes with a strong focus on risk mitigation and system stability
  • Requirements:

    This role requires strong technical depth in software engineering, cloud infrastructure, and incident-driven environments, along with the ability to remain effective under pressure.

    • Proven experience in Site Reliability Engineering, Platform Engineering, DevOps, or similar infrastructure-focused roles
    • Hands-on experience with Kubernetes, including deployment, debugging, and production troubleshooting
    • Strong understanding of CI/CD pipelines and modern DevOps practices
    • Software development experience in any modern language (Python or Java strongly preferred)
    • Strong automation mindset with a focus on reducing repetitive operational work through tooling
    • Experience with observability tools, monitoring systems, and alerting frameworks
    • Familiarity with AI/LLM-based workflows or agentic automation is highly desirable
    • Ability to manage high-severity incidents and communicate clearly with technical and non-technical stakeholders
    • Strong written and verbal communication skills in English
    • Self-driven, proactive mindset with the ability to operate independently in ambiguous situations
    • Experience in fintech or crypto environments is a plus
    • Benefits:

      • Remote-first work environment
      • Unlimited paid time off through a flexible time-off policy
      • Employee stock option program
      • Premium health, dental, and life insurance coverage (varies by country)
      • Extended family leave policies for all parents
      • Zero trading fees via internal crypto platform access
      • Strong focus on learning, autonomy, and professional growth
      • Opportunity to work on high-scale systems in a leading crypto infrastructure environment
Apply Now

Date Posted

06/25/2026

Views

0

Back to Job Listings Add To Job List Company Profile View Company Reviews
Neutral
Subjectivity Score: 0
142,000+ Jobs Tracked
12,400+ Companies
1,930 Categories