← all repositories

NX-AI/xlstm

A recurrent neural network architecture extending LSTM with exponential gating and matrix memory for language modeling.

2.2k stars Python Language ModelsML Frameworks
xlstm
Velocity · 7d
+2.8
★ / day
Trend
steady
star history

xLSTM is a new RNN architecture based on the original LSTM that incorporates Exponential Gating with normalization and stabilization techniques along with a new Matrix Memory to overcome traditional LSTM limitations. The architecture demonstrates competitive performance on language modeling tasks compared to Transformers and State Space Models. The repository includes training code and a 7B parameter xLSTM Large model trained on 2.3T tokens.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.