← all repositories

mbzuai-oryx/Awesome-LLM-Post-training

A curated collection of papers, code implementations, benchmarks, and resources on LLM post-training methodologies for reasoning models.

2.4k stars Python LearningLanguage Models
Awesome-LLM-Post-training
Velocity · 7d
+5.1
★ / day
Trend
steady
star history

This repository aggregates the most influential works on post-training large language models, covering techniques like supervised fine-tuning, reinforcement learning from human feedback (RLHF), and reasoning enhancement. It serves as both a survey referenced by an arXiv paper and a practical guide for researchers and practitioners working on training and improving LLMs after pre-training.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.