Principal AI Data Engineer
Company
Grindr
Location
Remote
Type
Full Time
Job Description
This is a hybrid role based in our Palo Alto or San Francisco offices and will require you to be in office Tuesdays and Thursdays.
What’s so interesting about this role?
We at Grindr believe that AI can revolutionize the dating industry. Our Data Engineer lead is responsible for building high quality ML datasets at scale, used to train ML models that power AI-centric features. In this pivotal role, you will have the opportunity to build foundational tools and data pipelines to ingest, normalize and clean the valuable data that would be fundamental for for our ML engineers in Grindr to build AI tools including recommendations, LLMs, ads, visual search, growth/notifications, trust and safety.Â
What’s the job?
We’re looking for an exceptional data engineer who is passionate about data for AI and values it can bring to Grindr, Who loves working with data ops at scale; and who is committed to the hard work necessary to continuously improve our ML data pipelines.
In this position, you will be responsible for establishing and executing the strategy for our organization’s ML Data Engine, with an initial focus on agile ML Data OPs. This includes identification of infrastructure components and data stack to be used, design and implementation of pipelines between data systems and teams, automation workflows, data enrichment and monitoring tools all for AI models. As a tech lead specialized in data engineering, you are expected to code and contribute to the stack.
Responsibilities:
- Dive into our dataset and design, implement and scale data pre/post processing pipelines of ML models
- Work on applied ML solutions in the areas of data mining, cleaning, normalizing and modeling
- Be self-motivated in seeking solutions when the correct path isn’t always known
- Collaborate with engineers in conceptualizing, planning and implementing data engineering initiatives working with different stakeholdersÂ
- Design and build data platforms & frameworks for processing high volumes of data, in real time as well as batch, that will be used across engineering teamsÂ
- Build data processing streams for cleaning and modeling text data for LLMs
- Research and evaluate new technologies in the big data space to guide our continuous improvementÂ
- Collaborate with multi-functional teams to help tune the performance of large data applicationsÂ
- Work with Privacy and Security team on data governance, risk and compliance initiativesÂ
- Work on initiatives to ensure stability, performance and reliability of our data infrastructureÂ
What we’ll love about youÂ
- Bachelors in Computer Science, Mathematics, Physics, or a related fields
- 5+ years of experience as a data engineer building production-level pre/post-processing data pipelines for ML/DL models, including 2+ years of technical leadership experience
- Experience in statistical analysis & visualization on datasets using Pandas or R
- Experience designing and building highly available, distributed systems of data extraction, ingestion, normalization and processing of large data sets in real time as well as batch, that will be used across engineering teams using orchestration frameworks like Airflow, KubeFlow or other pipeline tools
- Demonstrated prior experience in creating data pipelines for text data sets NLP/ large language models
- Ability to produce well-engineered software, including appropriate automated test suites, technical documentation, and operational strategy
- Excellent coding skills in Python, Java, bash, SQL, and expertise with Git version control
- Experience using big data technologies (Snowflake, Airflow, Kubernetes, Docker, Helm, Spark, pySpark)
- Experience with any public cloud environment - AWS, GCP or Azure
- Significant experience with relational databases and query authoring (SQL) as well as  NoSQL databases like DynamoDB etc
- Experience building and maintaining ETL (managing high-quality reliable ETL pipelines)
We’ll really swoon if you have   Â
- 2+ years of experience of technical leadership in building data engineering pipelines for AI
- Previous experience in building data pipeline for conversational AI APIs and recommender systems
- Experience with distributed systems and microservices
- Experience with Kubernetes and building Docker images
- Experience with building stream-processing systems, using solutions such as Kafka, Storm or Spark-Streaming
- Strong understanding of applied machine learning topics
- Be familiar with legal compliance (with data management tools) data classification, and retention
- Consistent track record of managing and implementing complex data projects
What you'll love about us
- Mission and Impact: Grindr is the world-leading LGBTQ social networking service. Your role will impact the lives of millions of LGBTQ people around the world
- Multiple Locations: We are hiring someone for this role to be based ideally in San Francisco or Palo Alto
- Family Insurance: Insurance premium coverage for health, dental, and vision for you and partial coverage for your dependents
- Retirement Savings: Generous 401K plan with 6% match and immediate vest in the US
- Compensation: Industry-competitive compensation and eligibility for company bonus and equity programs
- Queer-Inclusive Benefits: Industry-leading gender-affirming offerings with up to 90% cost coverage, access to Included Health, monthly stipends for HRT, and more
- Additional Benefits: Flexible vacation policy, monthly stipends for cell phone, internet, wellness, and food, one-time home-office setup stipend, and company-sponsored events
About Grindr
Grindr is the world’s largest dating app for gay, bi, trans, and queer people. With around 13 million monthly active users, Grindr has become a fundamental part of the global LGBTQ community, and we take pride in empowering our users to connect, express themselves, and discover the queer world around them.
Our next evolution is underway as a newly public company looking to expand and continue to build meaningful experiences for our users. From social issues to original content, we’re blazing innovative paths with impact for our community, all while leveraging the latest tech stacks and striving for engineering excellence. At the heart of our work in this new chapter is a shared set of core values: openness and exploration, a bias for action, and strong support of the LGBTQ community.Â
With a track record of strong financial performance and plans for continued headcount growth, we’re looking to build a team of talented, passionate, and open-minded people who believe in our mission, align with our values, and are excited to work at the intersection of innovative technology and social impact. Come be a part of this exciting journey with us.
Grindr is an equal-opportunity employer
To learn more about how we handle the personal data of applicants, visit our Employee and Candidate Privacy Policy.
Â
#LI-Hybrid
Grindr is committed to fair and equitable compensation practices. This base pay range is for the U.S. and is not applicable to locations outside of the U.S. The actual base pay is dependent upon many factors, such as training, transferable skills, work experience, business needs, location, and market demands. The base pay range is subject to change and may be modified in the future. This role will also be eligible for equity, benefits, and a company bonus program.
Date Posted
11/06/2023
Views
14
Similar Jobs
Account Manager, Care Partnerships - Headway
Views in the last 30 days - 0
Headway a mental health care company founded in 2019 aims to revolutionize mental healthcare by building a national network of providers accepting ins...
View DetailsDirector of Pricing - Garner Health
Views in the last 30 days - 0
Garner Health is a rapidly growing company backed by toptier venture capital firms Their mission is to transform the healthcare economy by delivering ...
View DetailsDirector, Product, Customer, and Lifecycle Marketing - Garner Health
Views in the last 30 days - 0
Garner Health is seeking an experienced Product Marketing Leader to join their team The ideal candidate will lead the product marketing efforts focusi...
View DetailsLinux Support Engineer - Voltage Park
Views in the last 30 days - 0
Voltage Park is seeking a Linux Support Engineer for a fulltime remote position The ideal candidate will have command line level Linux sys administrat...
View DetailsData Analyst - Agero
Views in the last 30 days - 0
Agero a leading B2B whitelabel provider of digital driver assistance services is revolutionizing the vehicle ownership experience through datadriven t...
View DetailsDirector, Product (Remote) - Dscout
Views in the last 30 days - 0
Dscout is a leading company in experience research technology offering a platform for major companies to gain insights into user needs and behaviors T...
View Details