ML Engineer, Reinforcement Learning from Human Feedback - US Remote
Company
Hugging Face
Location
Remote
Type
Full Time
Job Description
We have built the fastest-growing, open-source, library of pre-trained models in the world. With over 130K+ models and 110K+ stars on GitHub, over 10 thousand companies are using HF technology in production, including leading AI organizations such as Google, Elastic, Salesforce, Algolia, and Grammarly.
About the Role
As a machine learning engineer focused on Reinforcement Learning from Human Feedback (RLHF), you will work closely with researchers and engineers in Hugging Face's open reproduction team. From developing prototypes, to creating and monitoring experiments for designing new novel machine learning architectures, you will experience all cycles of typical industry research while executing a research agenda from start to finish.
This role is particularly well suited for someone who is looking to do research and engineering to build tools that will make RLHF accessible to many.
About you
You have a deep interest in conducting thorough research on a specific topic from the start to the end while working closely with the Hugging Face researcher. You have a passion for any topic related to RLHF: natural language processing, deep learning, reinforcement learning, synthetic data generation, and more.
Some of our requirements for this role:
- Working towards an MS or PhD degree in Computer Science or relevant field.
- Experience with PyTorch or any other major deep learning framework of choice.
- Experience with a domain(s) related to RLHF: natural language processing, reinforcement learning, synthetic data generation, or another related field.
- Problem solving and good communication skills.
- Some experience with Hugging Face's tools and ecosystem.
If you're interested in joining us, but don't tick every box above, we still encourage you to apply! We're building a diverse team whose skills, experiences, and backgrounds complement one another. We're happy to consider where you might be able to make the biggest impact.
More about Hugging Face
We are actively working to build a culture that values diversity, equity, and inclusivity. We are intentionally building a workplace where people feel respected and supported-regardless of who you are or where you come from. We believe this is foundational to building a great company and community. Hugging Face is an equal opportunity employer and we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
We value development. You will work with some of the smartest people in our industry. We are an organization that has a bias for impact and is always challenging ourselves to grow continuously. We provide all employees with reimbursement for relevant conferences, training, and education.
We care about your well-being. We offer flexible working hours and remote options. We offer health, dental, and vision benefits for employees and their dependents. We also offer 12 weeks of parental leave (20 for birthing mothers) and unlimited paid time off.
We support our employees wherever they are. While we have office spaces in NYC and Paris, we're very distributed and all remote employees have the opportunity to visit our offices. If needed, we'll also outfit your workstation to ensure you succeed.
We want our teammates to be shareholders. All employees have company equity as part of their compensation package. If we succeed in becoming a category-defining platform in machine learning and artificial intelligence, everyone enjoys the upside.
We support the community. We believe major scientific advancements are the result of collaboration across the field. Join a community supporting the ML/AI community.
Date Posted
04/07/2023
Views
8
Similar Jobs
Account Manager, Care Partnerships - Headway
Views in the last 30 days - 0
Headway a mental health care company founded in 2019 aims to revolutionize mental healthcare by building a national network of providers accepting ins...
View DetailsDirector of Pricing - Garner Health
Views in the last 30 days - 0
Garner Health is a rapidly growing company backed by toptier venture capital firms Their mission is to transform the healthcare economy by delivering ...
View DetailsDirector, Product, Customer, and Lifecycle Marketing - Garner Health
Views in the last 30 days - 0
Garner Health is seeking an experienced Product Marketing Leader to join their team The ideal candidate will lead the product marketing efforts focusi...
View DetailsLinux Support Engineer - Voltage Park
Views in the last 30 days - 0
Voltage Park is seeking a Linux Support Engineer for a fulltime remote position The ideal candidate will have command line level Linux sys administrat...
View DetailsData Analyst - Agero
Views in the last 30 days - 0
Agero a leading B2B whitelabel provider of digital driver assistance services is revolutionizing the vehicle ownership experience through datadriven t...
View DetailsDirector, Product (Remote) - Dscout
Views in the last 30 days - 0
Dscout is a leading company in experience research technology offering a platform for major companies to gain insights into user needs and behaviors T...
View Details