← all repositories

ridgerchu/matmulfreellm

A language model architecture that eliminates matrix multiplication operations using efficient ternary weights, available as Hugging Face models.

matmulfreellm
Velocity · 7d
+4.0
★ / day
Trend
steady
star history

MatMul-Free LM replaces standard matrix multiplication with more efficient operations, reducing computational cost while maintaining language modeling capability. The repository provides custom model implementations compatible with the Hugging Face Transformers library, along with pre-trained checkpoints at 370M, 1.3B, and 2.7B parameter scales. It uses Triton for kernel optimization and follows scaling laws demonstrating efficiency advantages over standard Transformer architectures.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.