alexiglad/EBT
A PyTorch implementation of Energy-Based Transformers, a novel transformer architecture enabling generalizable reasoning and scalable learning across modalities.

Velocity · 7d
+1.8
★ / day
Trend
→steady
star history
The repository provides code for training Energy-Based Transformers (EBTs), a new approach to transformer architecture that enables System 2 Thinking over every prediction. The method scales across modalities, data, depth, and parameters, demonstrating better generalization than standard transformer models. It includes training loops, model implementations, and experiments on language and video tasks.