Software Engineer - Data Infrastructure

Reddit · USA

Company

Reddit

Location

USA

Type

Full Time

Job Description

The Data Infrastructure team is looking to hire a Software Engineer who is excited to solve large scale batch and streaming data challenges.

Reddit’s mission is to bring community and belonging to everyone in the world. Reddit is a community of communities where people can dive into anything through experiences built around their interests hobbies and passions. With more than 50 million people visiting 100000+ communities daily it is home to the most open and authentic conversations on the internet. From pets to parenting skincare to stocks there’s a community for everybody on Reddit. For more information visit redditinc.com.

Our community of users generates over 150B analytics events per day each of which is ingested by the Data Infrastructure team into a data warehouse that sees 55000+ daily queries. We utilize this data to enable both batch and streaming data usage at the company. The team also owns our Streaming Platform that is built using Flink

As a software engineer you will partner with your team and partner teams like machine learning and Ads to create and improve scalable fault tolerant self-serve systems. You will also also:

  • Refine and maintain our data infrastructure technologies to support real-time analysis of hundreds of millions of users.

  • Own the data pipeline that surfaces 100B+ daily events to all teams and the tools we use ingestion storage and to improve data quality.

  • Building opinionated guardrails to drive improvements in data quality cost efficiency and data governance

  • Software automation that connects our data services and surfaces metadata to downstream customers for discovery and data contract enforcement

  • Monitoring/alerting for our core systems and the mechanisms built on top

If you have a passion for building and maintaining high quality code want to improve how Reddit makes strategic decisions at the company level and are excited about applying engineering best practices to one of the most powerful corpus of data in the world then this is the team for you!

In your day-to-day you can expect to:

  • Collaborate effectively with a team of proficient software engineers to develop and maintain the fundamental platform that powers the cutting-edge Reddit's data infrastructure

  • Engage in the complete data lifecycle at Reddit participating in the development process and working with one of the world's most extensive and data-rich datasets.

  • Design Build and Deliver end-to-end data solutions to improve the reliability scalability latency and efficiency of Reddit’s Data Platform

  • Implement automation for key elements of the development process including data quality managing alerts and handling critical infrastructure operations.

  • Collaborate and Share on-call responsibilities including incident management

Who you might be:

  • 3+ years of software engineering experience in a production setting writing clean maintainable and well-tested code

  • Proficient in object-oriented programming languages like Scala Python Go or Java.

  • Demonstrated expertise in designing and implementing large-scale systems diligently monitoring project progress and showcasing proactive leadership as a self-starter on diverse projects

  • Experience working with cloud services terraform airflow Kubernetes CI/CD Flink and working with modern cloud-based infrastructure

  • Excellent communication skills tailored for effective collaboration within both a service-oriented team and the broader organizational context

Benefits:

  • Comprehensive Healthcare Benefits

  • 401k Matching

  • Workspace benefits for your home office

  • Personal & Professional development funds

  • Family Planning Support

  • Flexible Vacation (please use them!) & Reddit Global Wellness Days

  • 4+ months paid Parental Leave

  • Paid Volunteer time off

#LI-remote #LI-JS5

Apply Now

Date Posted

08/09/2024

Views

1

Back to Job Listings Add To Job List Company Profile View Company Reviews
Positive
Subjectivity Score: 0.8

Similar Jobs

Engineering Manager - Software Supply Chain Security: Auth Infrastructure - GitLab

Views in the last 30 days - 0

This job description highlights a leadership role in developing secure scalable authentication infrastructure for GitLab It emphasizes technical exper...

View Details

Software Engineer III | Platform - ExtraHop

Views in the last 30 days - 0

This job posting seeks a Software Engineer III to develop features lead junior team members and contribute to secure cloud and appliance solutions The...

View Details

Staff Salesforce Engineer - CRM Systems - GitLab

Views in the last 30 days - 0

This job description outlines a Staff Salesforce Developer role focusing on designing building and scaling enterprisegrade solutions across Salesforce...

View Details

DevOps Engineer - Guidehouse

Views in the last 30 days - 0

This job posting seeks a skilled DevOps Engineer to support development QA and operations across applications emphasizing automation cloudnative infra...

View Details

Software Solutions Architect - Unqork

Views in the last 30 days - 0

Unqork empowers enterprises with AIpowered applications emphasizing innovation security and growth The job posting highlights benefits like remote wor...

View Details

Data Scientist - Capstone Integrated Solutions

Views in the last 30 days - 0

Capstone Integrated Solutions promotes itself as a customerfocused provider offering comprehensive software services and seeks a Data Scientist with e...

View Details