Job Description
We're looking for a savvy and experienced Senior Data Engineer to join the Data Platform Engineering team at Hims. As a Senior Data Engineer you will work with the analytics engineers product managers engineers security DevOps analytics and machine learning teams to build a data platform that backs the self-service analytics machine learning models and data products serving over a million Hims & Hers users.
You Will:
-
Architect and develop data pipelines to optimize performance quality and scalability
-
Build maintain & operate scalable performant and containerized infrastructure required for optimal extraction transformation and loading of data from various data sources
-
Design develop and own robust scalable data processing and data integration pipelines using Python dbt Kafka Airflow PySpark SparkSQL and REST API endpoints to ingest data from various external data sources to Data Lake
-
Develop testing frameworks and monitoring to improve data quality observability pipeline reliability and performance
-
Orchestrate sophisticated data flow patterns across a variety of disparate tooling
-
Support analytics engineers data analysts and business partners in building tools and data marts that enable self-service analytics
-
Partner with the rest of the Data Platform team to set best practices and ensure the execution of them
-
Partner with the analytics engineers to ensure the performance and reliability of our data sources
-
Partner with machine learning engineers to deploy predictive models
-
Partner with the legal and security teams to build frameworks and implement data compliance and security policies
-
Partner with DevOps to build IaC and CI/CD pipelines
-
Support code versioning and code deployments for data Pipelines
You Have:
-
8+ years of professional experience designing creating and maintaining scalable data pipelines using Python API calls SQL and scripting languages
-
Demonstrated experience writing clean efficient & well-documented Python code and are willing to become effective in other languages as needed
-
Demonstrated experience writing complex highly optimized SQL queries across large data sets
-
Experience with cloud technologies such as AWS and/or Google Cloud Platform
-
Experience building event streaming pipelines using Kafka/Confluent Kafka
-
Experience with IaC technologies like Terraform
-
Experience with data warehouses like BigQuery Databricks Snowflake and Postgres
-
Experience with Databricks platform
-
Experience with modern data stack like Airflow/Astronomer Databricks dbt Fivetran Confluent Tableau/Looker
-
Experience with containers and container orchestration tools such as Docker or Kubernetes
-
Experience with Machine Learning & MLOps
-
Experience with CI/CD (Jenkins GitHub Actions Circle CI)
-
Thorough understanding of SDLC and Agile frameworks
-
Project management skills and a demonstrated ability to work autonomously
Nice to Have:
-
Experience building data models using dbt
-
Experience with Javascript and event tracking tools like GTM
-
Experience designing and developing systems with desired SLAs and data quality metrics
-
Experience with microservice architecture
-
Experience architecting an enterprise-grade data platform
Date Posted
06/28/2024
Views
3
Similar Jobs
Staff Salesforce Engineer - CRM Systems - GitLab
Views in the last 30 days - 0
This job description outlines a Staff Salesforce Developer role focusing on designing building and scaling enterprisegrade solutions across Salesforce...
View DetailsSoftware Engineer III | Platform - ExtraHop
Views in the last 30 days - 0
This job posting seeks a Software Engineer III to develop features lead junior team members and contribute to secure cloud and appliance solutions The...
View DetailsDevOps Engineer - Guidehouse
Views in the last 30 days - 0
This job posting seeks a skilled DevOps Engineer to support development QA and operations across applications emphasizing automation cloudnative infra...
View DetailsData Scientist - Capstone Integrated Solutions
Views in the last 30 days - 0
Capstone Integrated Solutions promotes itself as a customerfocused provider offering comprehensive software services and seeks a Data Scientist with e...
View DetailsEngineering Manager - Software Supply Chain Security: Auth Infrastructure - GitLab
Views in the last 30 days - 0
This job description highlights a leadership role in developing secure scalable authentication infrastructure for GitLab It emphasizes technical exper...
View DetailsGrowth Product Lead - Loyalty - Trafilea
Views in the last 30 days - 0
Trafilea promotes itself as a transformative consumer tech platform with AIdriven growth solutions highlighting achievements like 1B revenue and globa...
View Details