← all repositories

opendilab/awesome-RLHF

An awesome list aggregating research papers, codebases, and datasets on Reinforcement Learning with Human Feedback for language model alignment.

awesome-RLHF
Velocity · 7d
+3.6
★ / day
Trend
steady
star history

This repository compiles research papers and resources on Reinforcement Learning with Human Feedback (RLHF), a technique used to align large language models with human preferences. The collection is organized by year from 2020 to 2026 and includes codebases, datasets, blogs, and books relevant to the RLHF ecosystem. It serves as a reference for understanding how RLHF enables models like ChatGPT to better match complex human values.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.