TsinghuaC3I/Awesome-RL-for-LRMs
A comprehensive survey and paper collection on reinforcement learning methods applied to large reasoning models.

Velocity · 7d
+5.5
★ / day
Trend
→steady
star history
This repository hosts a survey paper and curated list of research works on applying reinforcement learning to Large Reasoning Models (LRMs). It catalogs papers on techniques like RLHF, GRPO, and reasoning-focused training methods for advanced LLMs. The collection covers topics including chain-of-thought reasoning, process reward models, and multi-agent RL systems for LRM improvement.