← all repositories

alxndrTL/mamba.py

A pure PyTorch and MLX implementation of the Mamba state space model architecture for language modeling.

1.5k stars Python Language ModelsML Frameworks
mamba.py
Velocity · 7d
+1.7
★ / day
Trend
steady
star history

This repository provides a clean, efficient implementation of Mamba in pure PyTorch with parallel scan for faster training and inference. It includes support for related architectures like Jamba (Mamba + attention hybrid) and Vision Mamba, as well as muP (maximal update parameterization) for better hyperparameter transfer across model scales. The implementation is now integrated into the Hugging Face transformers library and supports both CUDA and MLX backends.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.