← all repositories

sail-sg/Adan

Adaptive Nesterov momentum optimizer for training deep neural networks.

Adan
Velocity · 7d
+0.6
★ / day
Trend
steady
star history

Adan is a PyTorch-native optimization algorithm that adaptively adjusts momentum for faster convergence when training deep learning models. It is integrated into major ML frameworks including PyTorch, NeMo, TIMM, and PaddlePaddle. The optimizer is the default choice for training text-to-3D models (DreamFusion, Consistent3D), diffusion transformers (MDT), and language models.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.