ML Engineer, Reinforcement Learning from Human Feedback - US Remote

Hugging Face • Remote

Company

Hugging Face

Location

Remote

Type

Full Time

Job Description

Here at Hugging Face, we're on a journey to advance good Machine Learning and make it more accessible. Along the way, we contribute to the development of technology for the better.

We have built the fastest-growing, open-source, library of pre-trained models in the world. With over 130K+ models and 110K+ stars on GitHub, over 10 thousand companies are using HF technology in production, including leading AI organizations such as Google, Elastic, Salesforce, Algolia, and Grammarly.

About the Role

As a machine learning engineer focused on Reinforcement Learning from Human Feedback (RLHF), you will work closely with researchers and engineers in Hugging Face's open reproduction team. From developing prototypes, to creating and monitoring experiments for designing new novel machine learning architectures, you will experience all cycles of typical industry research while executing a research agenda from start to finish.

This role is particularly well suited for someone who is looking to do research and engineering to build tools that will make RLHF accessible to many.

About you

You have a deep interest in conducting thorough research on a specific topic from the start to the end while working closely with the Hugging Face researcher. You have a passion for any topic related to RLHF: natural language processing, deep learning, reinforcement learning, synthetic data generation, and more.

Some of our requirements for this role:

Working towards an MS or PhD degree in Computer Science or relevant field.
Experience with PyTorch or any other major deep learning framework of choice.
Experience with a domain(s) related to RLHF: natural language processing, reinforcement learning, synthetic data generation, or another related field.
Problem solving and good communication skills.
Some experience with Hugging Face's tools and ecosystem.

If you're interested in joining us, but don't tick every box above, we still encourage you to apply! We're building a diverse team whose skills, experiences, and backgrounds complement one another. We're happy to consider where you might be able to make the biggest impact.

More about Hugging Face

We are actively working to build a culture that values diversity, equity, and inclusivity. We are intentionally building a workplace where people feel respected and supported-regardless of who you are or where you come from. We believe this is foundational to building a great company and community. Hugging Face is an equal opportunity employer and we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

We value development. You will work with some of the smartest people in our industry. We are an organization that has a bias for impact and is always challenging ourselves to grow continuously. We provide all employees with reimbursement for relevant conferences, training, and education.

We care about your well-being. We offer flexible working hours and remote options. We offer health, dental, and vision benefits for employees and their dependents. We also offer 12 weeks of parental leave (20 for birthing mothers) and unlimited paid time off.

We support our employees wherever they are. While we have office spaces in NYC and Paris, we're very distributed and all remote employees have the opportunity to visit our offices. If needed, we'll also outfit your workstation to ensure you succeed.

We want our teammates to be shareholders. All employees have company equity as part of their compensation package. If we succeed in becoming a category-defining platform in machine learning and artificial intelligence, everyone enjoys the upside.

We support the community. We believe major scientific advancements are the result of collaboration across the field. Join a community supporting the ML/AI community.

Date Posted

04/07/2023

Views

Back to Job Listings ❤️Add To Job List Company Info View Company Reviews

Positive

Subjectivity Score: 0.9

Similar Jobs

Senior Software Engineer - Mozilla

Views in the last 30 days - 0

Mozillas mission is to build an open internet focusing on privacy and innovation They offer impactful roles like Senior Software Engineer at AMO with ...

View Details

Cybersecurity Specialist - Red Team | Remote - Lumitekno Kreasi Global

Views in the last 30 days - 0

This job posting seeks a Cybersecurity Specialist Red Team member for remote security testing and system improvement The role involves realworld secur...

View Details

Senior Software Engineer - Mozilla

Views in the last 30 days - 0

Mozillas mission to improve the internet through opensource projects and innovation They seek a Senior Software Engineer to enhance AMO offering compe...

View Details

QA Automation Engineer - ActiveState

Views in the last 30 days - 0

The text describes a job opportunity for an Automation QA Engineer at ActiveState highlighting responsibilities involving automated testing frameworks...

View Details

Full Stack Software Engineer III Angular Java - MeridianLink

Views in the last 30 days - 0

This job posting seeks a Senior FullStack Software Engineer with expertise in Angular and Javabased backend development The role involves building res...

View Details

Solutions Architect - FireMon

Views in the last 30 days - 0

The text warns about a phishing attempt impersonating FireMon HR and outlines a Solutions Architect role with responsibilities in customer experience ...

View Details