pengzhiliang/MAE-pytorch
PyTorch implementation of Masked Autoencoders (MAE), a self-supervised vision transformer model for scalable visual learning.

Velocity · 7d
+1.6
★ / day
Trend
→steady
star history
This repository provides an unofficial PyTorch implementation of the MAE (Masked Autoencoders Are Scalable Vision Learners) paper. It implements the pretraining and fine-tuning processes for a vision transformer that learns by masking and reconstructing patches from images. The implementation includes shuffle/unshuffle operations for masked token handling and supports sine-cosine positional embeddings as described in the original paper.