Senior Data Engineer
Job Description
Team's Vision
The Machine Learning (ML) Engineering team at Disney drives and enables ML usage across several domains in heterogeneous language environments and at all stages of a project's life cycle, including ad-hoc exploration, preparing training data, model development, and robust production deployment. The team is invested in continual innovation of the ML infrastructure itself to carefully orchestrate a continuous cycle of learning, inference, and observation while also maintaining high system availability and reliability. We seek to find new ways to scale with our guest and partner base as well as the ever-growing need for ML and experiments.
Role
In this role you will partner with the ML Engineers and Data Scientists to help create and manage the datasets, contribute to the ML infrastructure by building and managing services that support and simplify ML development. You will conduct data exploration, feature engineering and build services. You will work on cross-functional projects and push the envelope on data and ML infrastructure.
Responsibilities:
Basic Qualifications:
Preferred Qualifications:
The hiring range for this position in California is $145,400 - $181,700 per year, in New York is $139,040 - $173,800 per year and Washington is $139,040 - $173,800 per year. The base pay actually offered will take into account internal equity and also may vary depending on the candidate's geographic region, job-related knowledge, skills, and experience among other factors. A bonus and/or long-term incentive units may be provided as part of the compensation package, in addition to the full range of medical, financial, and/or other benefits, dependent on the level and position offered.
The Machine Learning (ML) Engineering team at Disney drives and enables ML usage across several domains in heterogeneous language environments and at all stages of a project's life cycle, including ad-hoc exploration, preparing training data, model development, and robust production deployment. The team is invested in continual innovation of the ML infrastructure itself to carefully orchestrate a continuous cycle of learning, inference, and observation while also maintaining high system availability and reliability. We seek to find new ways to scale with our guest and partner base as well as the ever-growing need for ML and experiments.
Role
In this role you will partner with the ML Engineers and Data Scientists to help create and manage the datasets, contribute to the ML infrastructure by building and managing services that support and simplify ML development. You will conduct data exploration, feature engineering and build services. You will work on cross-functional projects and push the envelope on data and ML infrastructure.
Responsibilities:
- Design and develop data discovery tools, data quality and feature libraries
- Collaborate with ML practitioners to design and build data-forward solutions
- Deploy scalable streaming and batch data pipelines support petabyte scale datasets
- Build and maintain dimensional data, feature and model stores
- Ability to work on multi-faceted projects with engineers from diverse backgrounds, heterogenous skills and across teams.
- Drive and maintain a culture of quality, innovation and experimentation
- Work in an Agile environment that focuses on collaboration and teamwork
Basic Qualifications:
- 5+ years of software experience, with 3+ years of relevant data and software experience
- Experience in building large datasets and scalable services
- Experience deploying and running services in AWS, and engineering big-data solutions using technologies like Databricks, EMR, S3, Spark
- Experience loading and querying cloud-hosted databases such as Redshift and Snowflake
- Experience designing and developing backend microservices for large scale distributed systems using gRPC or REST.
- Experience with large-scale distributed data processing systems, cloud infrastructure such as AWS or GCP, and container systems such as Docker or Kubernetes.
Preferred Qualifications:
- Knowledge of the Python/Scala/Java data ecosystem
- Experience building streaming pipelines using Kafka, Spark, Flink, or Samza
- Excellent communication and people engagement skills
- Drive and maintain a culture of quality, innovation and experimentation
- Mentor colleagues on best practices and technical concepts of building large scale solutions
The hiring range for this position in California is $145,400 - $181,700 per year, in New York is $139,040 - $173,800 per year and Washington is $139,040 - $173,800 per year. The base pay actually offered will take into account internal equity and also may vary depending on the candidate's geographic region, job-related knowledge, skills, and experience among other factors. A bonus and/or long-term incentive units may be provided as part of the compensation package, in addition to the full range of medical, financial, and/or other benefits, dependent on the level and position offered.
Apply Now
Back to Job Listings
Add To Job List
Company Profile
View Company Reviews
Date Posted
10/29/2023
Views
3
Positive
Subjectivity Score: 0.9
Similar Jobs
Full Stack Software Engineer: Lead and Principal - Salesforce
Views in the last 30 days - 0
View DetailsExecutive Partnership Event, Senior Coordinator - Salesforce
Views in the last 30 days - 0
View DetailsLead Network Engineer - Backbone Engineering - Salesforce
Views in the last 30 days - 0
View DetailsSenior Business & Product Strategist- Workplace Services Education - Charles Schwab
Views in the last 30 days - 0
View Details