← all repositories

AIoT-MLSys-Lab/Efficient-LLMs-Survey

A comprehensive academic survey on efficiency techniques for large language models, published in TMLR 2024.

Efficient-LLMs-Survey
Velocity · 7d
+1.1
★ / day
Trend
steady
star history

This repository hosts a peer-reviewed survey paper covering methods for improving the efficiency of large language models, including techniques for model compression, quantization, distillation, and system optimization. It compiles and categorizes research across training, inference, and architectural improvements for LLMs.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.