← all repositories

TsinghuaC3I/Awesome-RL-for-LRMs

A comprehensive survey and paper collection on reinforcement learning methods applied to large reasoning models.

2.5k stars TeX LearningLanguage Models
Awesome-RL-for-LRMs
Velocity · 7d
+5.5
★ / day
Trend
steady
star history

This repository hosts a survey paper and curated list of research works on applying reinforcement learning to Large Reasoning Models (LRMs). It catalogs papers on techniques like RLHF, GRPO, and reasoning-focused training methods for advanced LLMs. The collection covers topics including chain-of-thought reasoning, process reward models, and multi-agent RL systems for LRM improvement.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.