ridgerchu/matmulfreellm
A language model architecture that eliminates matrix multiplication operations using efficient ternary weights, available as Hugging Face models.

MatMul-Free LM replaces standard matrix multiplication with more efficient operations, reducing computational cost while maintaining language modeling capability. The repository provides custom model implementations compatible with the Hugging Face Transformers library, along with pre-trained checkpoints at 370M, 1.3B, and 2.7B parameter scales. It uses Triton for kernel optimization and follows scaling laws demonstrating efficiency advantages over standard Transformer architectures.