← all repositories

BlinkDL/RWKV-LM

An RNN-based foundation language model achieving transformer-level performance with linear-time inference and no kv-cache requirements.

RWKV-LM
Velocity · 7d
+8.2
★ / day
Trend
steady
star history

RWKV is a novel neural network architecture that merges RNN and transformer characteristics, enabling parallelizable training like GPT while achieving LLM-quality results with linear time complexity and constant memory usage. The project provides complete training infrastructure, pre-trained model weights on HuggingFace, and efficient inference support including GGUF quantization for various deployment scenarios.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.