sgl-project/SpecForge
SpecForge is a training framework for speculative decoding models that can be deployed on the SGLang inference server.

Velocity · 7d
+2.4
★ / day
Trend
→steady
star history
SpecForge enables training speculative decoding models such as Eagle and Eagle3 using PyTorch and FSDP distributed training. Speculative decoding accelerates LLM inference by having a smaller draft model propose tokens that a larger verifier model accepts or rejects. Once trained, the models can be ported to SGLang for production serving.