← all repositories

horseee/Awesome-Efficient-LLM

A curated collection of papers and projects on efficient LLM techniques including quantization, pruning, knowledge distillation, and inference acceleration.

2k stars Python LearningLLMOps · Eval
Awesome-Efficient-LLM
Velocity · 7d
+1.8
★ / day
Trend
steady
star history

This repository maintains an organized list of research papers and open-source projects focused on making large language models more efficient. It covers areas such as network pruning, model quantization, knowledge distillation, inference acceleration, efficient architectures, and KV cache compression. The list is structured by sub-topic with separate markdown files and includes a project directory for implementations.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.