← all repositories

changyeyu/LLM-RL-Visualized

A repository of over 100 original SVG architecture diagrams explaining large language models, vision-language models, and reinforcement learning algorithms.

4.5k stars Python LearningLanguage Models
LLM-RL-Visualized
Velocity · 7d
+11
★ / day
Trend
steady
star history

The repository provides a large collection of hand-drawn SVG diagrams and PDF maps covering the architecture of LLMs and VLMs, training algorithms including RLHF, GRPO, DPO, SFT, and chain-of-thought distillation, and reinforcement learning fundamentals like policy gradient and PPO. It serves as a visual study guide complementing the published book 《大模型算法:强化学习、微调与对齐》. Topics span transformer structures, attention mechanisms, reward modeling, and RAG optimization.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.