Staff Site Reliability Engineer

neptune.ai · Europe

Company

neptune.ai

Location

Europe

Type

Full Time

Job Description

We are seeking an experienced Staff Site Reliability Engineer to join our fully remote team. As a key player in our Engineering team you will contribute to infrastructure design and optimization and have an impact on the scalability resilience and performance of Neptune solutions. This role demands a deep understanding of distributed systems performance optimization and the ability to drive significant business value through technical solutions.

Our tech stack (the bigger the overlap the better):

  • Languages : Rust JVM (Java Spring Scala Kotlin) Python.

  • Data : ClickHouse Kafka Elasticsearch Redis MySQL.

  • Cloud platforms : Microsoft Azure Google Cloud Platform (GCP).

  • DevOps tools : Kubernetes Terraform Helm.

  • Others : Protobufs gRPC Swagger.

Responsibilities:

  • Ownership of Site Reliability Process: Own the site reliability process and systems through all stages from design and implementation to deployment and continuous maintenance.

  • Infrastructure Optimization: Ensure the scalability resilience and performance of Neptune solutions across global SaaS and client-hosted environments including platforms such as GCP Azure AWS and on-premise systems.

  • Automation Strategy Development: Design and implement automation workflows to streamline deployments upgrades and incident response reducing manual tasks and enhancing operational efficiency and consistency.

  • Security and Compliance: Ensure infrastructure and processes meet security and industry standards protecting sensitive data.

  • Cross-Functional Collaboration: Partner with development product customer success and client teams to align on requirements and deliver robust scalable and reliable solutions.

  • Documentation and Knowledge Sharing: Document architecture operational procedures and troubleshooting guides to enable knowledge sharing repeatability and continuous improvement.

  • Incident Management: Participate in on-call rotations effectively addressing and resolving production incidents to maintain system uptime and performance.

You might be a fit if you have:

  • 6+ years in SRE DevOps or related roles.

  • Strong experience managing and optimizing Kubernetes clusters for robust scalable and efficient infrastructure.

  • Proven expertise in designing and implementing automation solutions for infrastructure and application deployment with experience in Terraform Helm and GitLab CI/CD.

  • Strong programming skills in Shell and Python .

  • Extensive experience with Linux system administration and network management.

  • Expertise in managing distributed computing systems and near real-time data streaming platforms.

  • Fluency in English with solid communication skills for interacting with global customers.

Nice to have:

  • Experience in security best practices compliance standards (e.g. SOC 2) and infrastructure hardening.

  • Experience with multi-cloud architecture and cloud-native technologies.

  • Experience in high-traffic petabyte-scale data environments.

  • Experience with ClickHouse and Kafka deployments.

We offer:

  • Flexibility : 100% remote work with offices (co-works) in Warsaw/Wrocław/Poznań/Kraków available and flexible working hours;

  • Share in our success : Participate in the Employee Stock Option Plan and be part of our growth journey;

  • Time off : 20 paid service-free days per year;

  • Ownership and impact : Space to take action bring your ideas to life and make a real impact.

Any questions?

Check out our ultimate guide for candidates to the neptune.ai Engineering team .

Don’t hesitate to contact our Talent Acquisition team and check out our About us page to get to know the story and faces behind Neptune.

By applying you consent for neptune.ai to process your personal data to assess your suitability for the role you have applied for in accordance with the General Data Protection Regulation (GDPR). Your personal data will remain confidential and shared only with authorized personnel involved in the recruitment process. You have the right to access rectify or delete your personal data at anytime. With your optional consent we can retain your data for up to 12 months after the application to consider you for future suitable roles if you’re not a match for the current position.

Apply Now

Date Posted

11/21/2024

Views

0

Back to Job Listings Add To Job List Company Profile View Company Reviews
Positive
Subjectivity Score: 0.8

Similar Jobs

Staff Backend Engineer - PHP + Go - Hostaway

Views in the last 30 days - 0

Hostaway offers a remote backend engineer role in Europe with competitive pay equity and a dynamic team culture The position involves integrating with...

View Details

Senior Full Stack Engineer - Swissblock

Views in the last 30 days - 0

Swissblock seeks a Full Stack Software Engineer to develop innovative financial tools The role involves creating userfriendly interfaces and improving...

View Details

Senior AI Full-Stack Software Engineer - Skedda

Views in the last 30 days - 0

Skedda is seeking a senior AIfocused fullstack developer to contribute to innovative workplace management solutions The role offers competitive compen...

View Details

Senior Go-to-Market (RevOps) Engineer - Skedda

Views in the last 30 days - 0

Skedda offers a competitive salary flexible work and a collaborative environment The role involves software development and innovation with a focus on...

View Details

Senior Platform Engineer - Infrastructure - Kalepa

Views in the last 30 days - 0

This job description highlights a senior engineering role with a competitive salary range of 85k155k equity options and benefits like PTO gym reimburs...

View Details

Senior Support Engineer - n8n

Views in the last 30 days - 0

n8n is a rapidly growing AI platform with a strong community and impressive achievements They offer competitive roles and a positive work culture emph...

View Details