← all repositories

SafeAILab/EAGLE

Speculative decoding method for fast LLM inference with provable performance maintenance.

EAGLE
Velocity · 7d
+2.6
★ / day
Trend
steady
star history

EAGLE (Extrapolation Algorithm for Greater Language-model Efficiency) accelerates LLM generation by extrapolating second-top-layer contextual feature vectors. It is evaluated by third parties as the fastest speculative decoding method, achieving 2x speedup over gpt-fast and 3x speedup over vanilla decoding on 13B models. The implementation covers EAGLE through EAGLE-3 across three major ML/AI conferences.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.