Senior Software Engineer- Compute

Reddit · USA

Company

Reddit

Location

USA

Type

Full Time

Job Description

The Compute team is looking to hire a Senior Software Engineer that thrives at the intersection of infrastructure and software development. This team’s challenges break into 2 domains which we consider platform engineering and cluster engineering .

Platform Engineering : Higher-level orchestration of both compute capacity and workload primitives to support our multi-cloud multi-region deployments. A subset of current focuses include:

  • Software automation that creates manages and destroys clusters in our fleet.

  • APIs and controllers that support multi-cluster deployment and scheduling mechanics.

  • Core SDKs that enable controller development in the larger organization.

  • Software that codifies out-of-cluster ancillary concerns such as network configurations and managed services.

Cluster Engineering : Intra-cluster engineering problems involving balancing performance efficiency and stability. A subset of current focuses include

  • Detection of node-level performance characteristics and making availability decisions based on the data.

  • Schedulers that support more efficient packing of resources along with reactive rescheduling on the basis of changing compute availability.

  • Kubernetes controllers that offer APIs in the cluster and perform reconciliation to reach a desired state.

  • Cluster upgrades both mechanical process concerns and automation.

As a member of the Compute team your work will span these 2 domains which are rich with challenging infrastructure and software engineering problems. Your work will directly impact hundreds of millions of users around the world. Join us and help build the future of Reddit!

In your day-to-day you can expect to:

  • Work collaboratively with a team of software engineers to create and maintain the foundational platform for running Reddit’s infrastructure.

  • Deliver software to improve the availability scalability latency and efficiency of Reddit’s Compute Platform.

  • Contribute feedback to the technical and strategic direction of the compute platform.

  • Automate critical aspects of the development process such as service creation and management as well as critical infrastructure operations.

  • Share on-call responsibilities with the Compute team.

You have:

  • 4+ years of experience developing internet-scale software preferably in the context of infrastructure.

    • Language proficiency in Go.

  • Experience developing on top of Kubernetes or similar distributed systems.

    • Kubernetes controller or operator development experience is a huge plus.

  • Proficiency operating Linux with a solid understanding around cgroups namespaces other multi-tenancy primitives.

  • Strong troubleshooting capabilities surrounding both systems and software.

  • Experience engineering large systems tracking work and being a self-starter on projects.

  • Excellent communication skills to collaborate with a service-oriented team and company.

Benefits:

  • Comprehensive Healthcare Benefits

  • 401k Matching

  • Workspace benefits for your home office

  • Personal & Professional development funds

  • Family Planning Support

  • Flexible Vacation (please use them!) & Reddit Global Wellness Days

  • 4+ months paid Parental Leave

  • Paid Volunteer time off

#LI-remote #LI-JS5

Apply Now

Date Posted

04/12/2024

Views

6

Back to Job Listings Add To Job List Company Profile View Company Reviews
Positive
Subjectivity Score: 0.9