← all repositories

NVIDIA/apex

NVIDIA's PyTorch extension enabling mixed precision and distributed training for deep learning models.

9k stars Python ML Frameworks
apex
Velocity · 7d
+3.0
★ / day
Trend
steady
star history

Apex provides NVIDIA-maintained utilities to streamline mixed precision (FP16) and distributed training in PyTorch. It offers custom CUDA/C++ extensions for optimized training performance, including fused kernels for attention and convolution operations. The project aims to make cutting-edge training utilities available to PyTorch users before upstream inclusion.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.