← all repositories

sgl-project/SpecForge

SpecForge is a training framework for speculative decoding models that can be deployed on the SGLang inference server.

SpecForge
Velocity · 7d
+2.4
★ / day
Trend
steady
star history

SpecForge enables training speculative decoding models such as Eagle and Eagle3 using PyTorch and FSDP distributed training. Speculative decoding accelerates LLM inference by having a smaller draft model propose tokens that a larger verifier model accepts or rejects. Once trained, the models can be ported to SGLang for production serving.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.