Staff Site Reliability Engineer (Platform Reliability)
Job Description
Team: IT
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Staff Site Reliability Engineer (Platform Reliability) in Spain.
This role offers the opportunity to be a key technical leader within a Platform Reliability team, helping to scale and maintain a highly reliable infrastructure across Europe. You will shape the evolution of cloud-native systems, define architectural solutions, and drive projects end-to-end with autonomy. Acting as a mentor and technical reference, you will support junior engineers while collaborating closely with backend, data, security, and engineering efficiency teams. The position combines hands-on coding, automation, and infrastructure design, ensuring high availability and observability for a growing platform. You will influence strategic decisions, contribute to cross-team problem-solving, and help embed reliability as a core principle across the organization.
Accountabilities:
- Lead and contribute to complex infrastructure projects, framing problems and delivering end-to-end solutions
- Design, deploy, and maintain scalable cloud-native infrastructure using Kubernetes, AWS, and other modern tooling
- Write production-grade code, tools, and APIs to improve platform reliability and automation
- Automate repetitive tasks to reduce operational toil and improve efficiency
- Ensure observability and monitoring across services, maintaining logs, metrics, and traces for visibility and debugging
- Participate in on-call rotations, lead post-incident reviews, and implement lasting fixes
- Mentor team members, share knowledge, and promote a culture of reliability and technical excellence
- Extensive hands-on experience with cloud-native infrastructure in production environments
- Strong experience managing Kubernetes clusters at scale and working with containerized workloads
- Proficiency in Go and/or Python for building services, tools, and automation
- Knowledge of CI/CD pipelines, GitOps, and infrastructure as code with Terraform
- Familiarity with monitoring, logging, and tracing tools such as Prometheus, Thanos, OpenTelemetry, Elasticsearch, and Loki
- Strong problem-solving skills with the ability to understand complex system dependencies and trade-offs
- Experience in AI-assisted engineering practices is a plus
- Excellent collaboration and mentoring skills, with a focus on knowledge sharing and team growth
- Competitive compensation and performance-based rewards
- Fully remote working flexibility from Spain
- Autonomy to drive projects and influence platform strategy
- Modern engineering practices, including AI-assisted development
- Professional growth opportunities and knowledge-sharing culture
- Access to state-of-the-art cloud infrastructure and tooling
- Support for wellbeing and work-life balance within a high-performing team
Requirements:
Benefits:
Explore More
Date Posted
04/09/2026
Views
0
Similar Jobs
Senior Product Marketing Manager (Core Products) - Jobgether
Views in the last 30 days - 0
View Details