BR-IDL/PaddleViT
A collection of vision transformer and MLP architectures for computer vision tasks including classification, object detection, semantic segmentation, and GAN, built on PaddlePaddle.

PaddleViT provides implementations of state-of-the-art Visual Transformers and MLP models for computer vision tasks such as image classification, object detection, semantic segmentation, and generative adversarial networks. The library integrates with the PaddlePaddle deep learning framework and includes model architectures, training/validation scripts, data augmentation utilities, and pretrained weights for fine-tuning.