Job Description
About Us
At Viaduct, we are developing an end-to-end machine learning platform to empower automakers to build safer, more intelligent, and personalized vehicles. Our platform increases the accessibility and actionability of connected vehicle data for automakers and their partners and end-customers. We are a diverse team motivated to solve the hardest problems in the automotive industry.
Who You Are
You are a thoughtful engineer. Comfortable being a high level IC as you lead teams. You understand the complexities of distributed systems and how to triage and solve issues that arise with them. Scalability is top of mind when designing any system or writing code. You believe building a better ETL system requires close collaboration with the machine learning and data science teams. You avoid reinventing the wheel unless necessary and are excited by opportunities to contribute to the open-source community.
Expected Skills
- 3+ years as a data engineer
- Experience leading teams
- Proficiency in Python and SQL
- Hands on experience with Spark or equivalent technologies
- Working proficiency with a workflow scheduler (Airflow, Prefect, Argo, etc)
- Experience with distributed file-systems (HDFS, S3, etc)
- Familiar with the tools in open-source data ecosystem (Apache, CNCF, etc)
- Experience with incremental or real-time processing (Delta Lake, Apache Hudi, Kafka Stream, Spark Streaming, etc)
Security and Privacy Responsibilities
- Follow our policy and procedure documents related to security and privacy
- Follow the guidelines in the Employee Handbook
- Participate in new hire and annual training for security and privacy
- Treat data security and privacy as one of your primary job responsibilities
About the Role
Day 5
- Learn about Viaduct’s history and mission
- Get to know every team member
- Set up your development environment
- Understand Viaduct’s ETL pipelines and run your first DAGs
- Deep dive into the nuances of vehicle data
- Attend our weekly ML lunch
Day 30
- Take ownership of ETL pipelines
- Identify scalability bottlenecks in the existing ETL pipelines
- Be familiar with the day-to-day work of machine learning engineers and data scientists
- Learn the architecture of data engineering systems and services
Day 90
- Be the ETL pipeline expert at Viaduct
- Improve overall data quality and discoverability
- Confident in the scalability of Viaduct’s ETL pipelines
- Present your work at our weekly ML lunch
- Comfortable contributing to our engineering infrastructure and systems
Date Posted
09/29/2022
Views
6
Similar Jobs
Senior Staff Simulation Engineer - Wisk
Views in the last 30 days - 0
Wisk Aero is seeking a Senior Staff Simulation Engineer to join their Flight Physics Vehicle Modeling FPVM team The role involves designing implementi...
View DetailsSenior Simulation Software Integration Engineer - Wisk
Views in the last 30 days - 0
Wisk is seeking a Senior Simulation Software Integration Engineer to lead the integration of highfidelity simulation models develop advanced test fram...
View DetailsStaff Data Engineer - AiDash
Views in the last 30 days - 0
AiDASH is a Series C climate tech startup offering a fullstack SaaS solution for making critical infrastructure industries climateresilient and sustai...
View DetailsSupport Engineer - Pricefx
Views in the last 30 days - 0
Pricefx a leading SaaS Pricing Price Optimization Management provider is seeking a Tier 34 Support Engineer The role involves providing technical sup...
View DetailsAvionics Mechanical Engineer (Harness) - Reliable Robotics Corporation
Views in the last 30 days - 0
Reliable Robotics is seeking an Avionics Mechanical Engineer to join their Vehicle Design and Integration team in Mountain View California The role in...
View DetailsSr. Flight Software Engineer (Verification) - Reliable Robotics Corporation
Views in the last 30 days - 0
Reliable Robotics is a team of missiondriven engineers developing safetyenhancing technology for aviation aiming to make air transportation safer more...
View Details