google-research/vision_transformer
A Google repository releasing pre-trained Vision Transformer and MLP-Mixer models for image classification, with JAX/Flax fine-tuning code.

Velocity · 7d
+6.1
★ / day
Trend
→steady
star history
This repository releases pre-trained computer vision models including Vision Transformers and MLP-Mixer architectures. The models were pre-trained on ImageNet and ImageNet-21k datasets. It provides code for fine-tuning these models using JAX and Flax frameworks, enabling transfer learning on custom image classification tasks.