← all repositories

ModelTC/LightCompress

A compression toolkit for large AI models including LLMs, VLMs, and diffusion generators.

LightCompress
Velocity · 7d
+0.9
★ / day
Trend
steady
star history

LightCompress provides off-the-shelf compression techniques for AI generated content models. It implements state-of-the-art algorithms including quantization, pruning, token merging, and token reduction to reduce model size and improve inference efficiency while maintaining model performance. The toolkit supports popular models like LLMs, VLMs, and diffusion-based video generators, with papers published at EMNLP 2024 and AAAI 2026.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.