Senior Customer Reliability Engineer

Astronomer · USA

Company

Astronomer

Location

USA

Type

Full Time

Job Description

Astronomer designed Astro, a modern data orchestration platform, powered by Apache Airflow™. Astro enables companies to place Apache Airflow at the core of their data operations, providing ease of use, scalability, and enterprise-grade security, to ensure the reliable delivery of mission-critical data pipelines.

We’re a globally-distributed and rapidly growing venture-backed team of learners, innovators and collaborators. Our mission is to build an Enterprise-grade product that makes it easy for data teams at Fortune 500’s and startups alike to adopt Apache Airflow. As a member of our team, you will be at the forefront of the industry as we strive to deliver the world's data.

Your background may be unconventional; as long as you have the essential qualifications, we encourage you to apply. While having "bonus" qualifications makes for a strong candidate, Astronomer values diverse experiences. Many of us at Astronomer haven't followed traditional career paths, and we welcome it if yours hasn't either.

About this role

The Astronomer Customer Reliability Engineering (CRE) team is responsible for the success of our customers' usage of our managed Airflow service.

The CREs are responsible for operating, monitoring, and maintaining the platform to ensure availability, predictability, and reliable operations.

As an infrastructure specialist within the team, you will focus on the reliability of the underlying cloud infrastructure and Kubernetes clusters. This entails responding to incidents either raised by a customer, or from our monitoring system and then taking further steps to ensure problems are permanently resolved or monitored. As owners of the observability platform, CRE has unlimited potential to improve the reliability of the product and deliver the best possible outcome for our customers.

This role is directly customer-facing and gives exposure to very diverse problems and requirements. CRE get the opportunity to interface with customers from a variety of industries across different cloud providers, and all with different expectations. Your contributions will directly impact customers' success with using the Astronomer products, and you will be able to help make meaningful improvements to the customer experience.

This position includes a requirement to work from 9AM to 3PM EST, Monday to Friday. Your remaining work time is flexible.

What you get to do:
  • Provide solutions to customers to make them successful using our products.

  • Troubleshoot customer environments and engage in active triaging with customers

  • Participate in on-call rotation for weekend coverage

  • Provide feedback to the product development teams on customer needs and pain points.

  • Build out our monitoring and alerting systems.

  • Build and maintain automation to ensure daily operational tasks are handled as efficiently as possible. 

  • Help direct the architecture of the products and contribute where possible.

  • Own the customer experience, working directly with customers to prioritize and solve issues, meet SLAs, and provide “white glove” guidance on the path to production.

  • Participate remotely within a fully distributed team.

  • Enhance and enrich customer documentation

  • Work with the latest technology and multi-cloud implementations

What you bring to the role:
  • 6 years of experience, preferably with large, complex cloud infrastructures operating at scale

  • 4 years of experience with Kubernetes

  • Experience managing a Production distributed system with at least one major cloud provider (one or all: AWS, GCP, Azure)

  • Strong Linux experience

  • Knowledge of how to operate and monitor issues for distributed systems 

  • Previous experience in handling customers issues (internal or external) 

  • Strong communication skills

  • DevOps or CI/CD experience

  • Python scripting

  • Good troubleshooting Skills 

Bonus points if you have:
  • Experience as a Site Reliability Engineer

  • Worked with Kubernetes Custom Resources

  • Depth of knowledge with Azure

  • Airflow/Big Data Orchestration experience

  • IaC experience

The estimated salary for this role ranges from $165,000-185,000, along with an equity component. This range is merely an estimate, and the width of the range reflects willingness to consider candidates with broad prior seniority. Actual compensation may deviate from this range based on skills, experience, and qualifications.

#LI-Remote

At Astronomer, we value diversity. We are an equal opportunity employer: we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.  Astronomer is a remote-first company.

Apply Now

Date Posted

10/04/2024

Views

0

Back to Job Listings Add To Job List Company Profile View Company Reviews
Positive
Subjectivity Score: 0.8

Similar Jobs

Staff Salesforce Engineer - CRM Systems - GitLab

Views in the last 30 days - 0

This job description outlines a Staff Salesforce Developer role focusing on designing building and scaling enterprisegrade solutions across Salesforce...

View Details

Software Engineer III | Platform - ExtraHop

Views in the last 30 days - 0

This job posting seeks a Software Engineer III to develop features lead junior team members and contribute to secure cloud and appliance solutions The...

View Details

DevOps Engineer - Guidehouse

Views in the last 30 days - 0

This job posting seeks a skilled DevOps Engineer to support development QA and operations across applications emphasizing automation cloudnative infra...

View Details

Senior Marketer - Usage Automation - HubSpot

Views in the last 30 days - 0

This job posting outlines a Senior Marketer role at HubSpot focusing on customer experience through datadriven automation Responsibilities include des...

View Details

Engineering Manager - Software Supply Chain Security: Auth Infrastructure - GitLab

Views in the last 30 days - 0

This job description highlights a leadership role in developing secure scalable authentication infrastructure for GitLab It emphasizes technical exper...

View Details

Growth Product Lead - Loyalty - Trafilea

Views in the last 30 days - 0

Trafilea promotes itself as a transformative consumer tech platform with AIdriven growth solutions highlighting achievements like 1B revenue and globa...

View Details