THUDM/LongWriter
LongWriter extends long-context LLMs to generate 10,000+ word outputs through supervised fine-tuning and RL-based training.

Velocity · 7d
+2.8
★ / day
Trend
→steady
star history
LongWriter addresses the limitation that LLMs struggle to produce very long outputs despite their long-context understanding. The project provides trained models (LongWriter-Zero-32B) and training code using both supervised fine-tuning on synthetic data and reinforcement learning (LongWriter-Zero). It supports fast inference via vLLM integration and includes an AgentWrite pipeline for multi-step planning during writing tasks.