ML Engineer, Reinforcement Learning from Human Feedback

ML Engineer, Reinforcement Learning from Human Feedback

Here at Hugging Face, we’re on a journey to advance good Machine Learning and make it more accessible. Along the way, we contribute to the development of technology for the better.

We have built the fastest-growing, open-source, library of pre-trained models in the world. With over 130K+ models and 110K+ stars on GitHub, over 10 thousand companies are using HF technology in production, including leading AI organizations such as Google, Elastic, Salesforce, Algolia, and Grammarly.

About the Role

As a machine learning engineer focused on Reinforcement Learning from Human Feedback (RLHF), you will work closely with researchers and engineers in Hugging Face's open reproduction team. From developing prototypes, to creating and monitoring experiments for designing new novel machine learning architectures, you will experience all cycles of typical industry research while executing a research agenda from start to finish.

This role is particularly well suited for someone who is looking to do research and engineering to build tools that will make RLHF accessible to many.

About you

You have a deep interest in conducting thorough research on a specific topic from the start to the end while working closely with the Hugging Face researcher. You have a passion for any topic related to RLHF: natural language processing, deep learning, reinforcement learning, synthetic data generation, and more.

Some of our requirements for this role:

  • Working towards an MS or PhD degree in Computer Science or relevant field.
  • Experience with PyTorch or any other major deep learning framework of choice.
  • Experience with a domain(s) related to RLHF: natural language processing, reinforcement learning, synthetic data generation, or another related field.
  • Problem solving and good communication skills.
  • Some experience with Hugging Face's tools and ecosystem.

If you're interested in joining us, but don't tick every box above, we still encourage you to apply! We're building a diverse team whose skills, experiences, and backgrounds complement one another. We're happy to consider where you might be able to make the biggest impact.


More about Hugging Face

We are actively working to build a culture that values diversity, equity, and inclusivity. We are intentionally building a workplace where people feel respected and supported—regardless of who you are or where you come from. We believe this is foundational to building a great company and community. Hugging Face is an equal opportunity employer and we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

We value development. You will work with some of the smartest people in our industry. We are an organization that has a bias for impact and is always challenging ourselves to grow continuously. We provide all employees with reimbursement for relevant conferences, training, and education.

We care about your well-being. We offer flexible working hours and remote options. We offer health, dental, and vision benefits for employees and their dependents. We also offer 12 weeks of parental leave (20 for birthing mothers) and unlimited paid time off.

We support our employees wherever they are. While we have office spaces in NYC and Paris, we're very distributed and all remote employees have the opportunity to visit our offices. If needed, we'll also outfit your workstation to ensure you succeed.

We want our teammates to be shareholders. All employees have company equity as part of their compensation package. If we succeed in becoming a category-defining platform in machine learning and artificial intelligence, everyone enjoys the upside.

We support the community. We believe major scientific advancements are the result of collaboration across the field. Join a community supporting the ML/AI community.

Logos/outerjoin logo full

Outer Join is the premier job board for remote jobs in data science, analytics, and engineering.