← all repositories

HKUDS/LightReasoner

A post-training framework where small language models generate reasoning traces to teach large language models, improving their reasoning ability.

610 stars Python Language ModelsLLMOps · Eval
LightReasoner
Velocity · 7d
+2.4
★ / day
Trend
steady
star history

LightReasoner is an ACL 2026 Oral paper exploring knowledge distillation for reasoning in LLMs. The framework uses smaller models to generate and label reasoning data that is then used to fine-tune larger models, with a focus on token efficiency. It releases trained checkpoints and evaluates against reasoning benchmarks like Qwen2.5-Math baselines.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.