Staff Site Reliability Engineer (Platform Reliability)

Jobgether · Spain

Company

Jobgether

Location

Spain

Type

Full Time

Job Description

Team: IT

This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Staff Site Reliability Engineer (Platform Reliability) in Spain.

This role offers the opportunity to be a key technical leader within a Platform Reliability team, helping to scale and maintain a highly reliable infrastructure across Europe. You will shape the evolution of cloud-native systems, define architectural solutions, and drive projects end-to-end with autonomy. Acting as a mentor and technical reference, you will support junior engineers while collaborating closely with backend, data, security, and engineering efficiency teams. The position combines hands-on coding, automation, and infrastructure design, ensuring high availability and observability for a growing platform. You will influence strategic decisions, contribute to cross-team problem-solving, and help embed reliability as a core principle across the organization.

Accountabilities:

  • Lead and contribute to complex infrastructure projects, framing problems and delivering end-to-end solutions
  • Design, deploy, and maintain scalable cloud-native infrastructure using Kubernetes, AWS, and other modern tooling
  • Write production-grade code, tools, and APIs to improve platform reliability and automation
  • Automate repetitive tasks to reduce operational toil and improve efficiency
  • Ensure observability and monitoring across services, maintaining logs, metrics, and traces for visibility and debugging
  • Participate in on-call rotations, lead post-incident reviews, and implement lasting fixes
  • Mentor team members, share knowledge, and promote a culture of reliability and technical excellence
  • Requirements:

    • Extensive hands-on experience with cloud-native infrastructure in production environments
    • Strong experience managing Kubernetes clusters at scale and working with containerized workloads
    • Proficiency in Go and/or Python for building services, tools, and automation
    • Knowledge of CI/CD pipelines, GitOps, and infrastructure as code with Terraform
    • Familiarity with monitoring, logging, and tracing tools such as Prometheus, Thanos, OpenTelemetry, Elasticsearch, and Loki
    • Strong problem-solving skills with the ability to understand complex system dependencies and trade-offs
    • Experience in AI-assisted engineering practices is a plus
    • Excellent collaboration and mentoring skills, with a focus on knowledge sharing and team growth
    • Benefits:

      • Competitive compensation and performance-based rewards
      • Fully remote working flexibility from Spain
      • Autonomy to drive projects and influence platform strategy
      • Modern engineering practices, including AI-assisted development
      • Professional growth opportunities and knowledge-sharing culture
      • Access to state-of-the-art cloud infrastructure and tooling
      • Support for wellbeing and work-life balance within a high-performing team
Apply Now

Date Posted

04/09/2026

Views

0

Back to Job Listings Add To Job List Company Profile View Company Reviews
Neutral
Subjectivity Score: 0

© 2026 Job Transparency. All rights reserved.