mbzuai-oryx/Awesome-LLM-Post-training
A curated collection of papers, code implementations, benchmarks, and resources on LLM post-training methodologies for reasoning models.

Velocity · 7d
+5.1
★ / day
Trend
→steady
star history
This repository aggregates the most influential works on post-training large language models, covering techniques like supervised fine-tuning, reinforcement learning from human feedback (RLHF), and reasoning enhancement. It serves as both a survey referenced by an arXiv paper and a practical guide for researchers and practitioners working on training and improving LLMs after pre-training.