NVIDIA/apex
NVIDIA's PyTorch extension enabling mixed precision and distributed training for deep learning models.

Velocity · 7d
+3.0
★ / day
Trend
→steady
star history
Apex provides NVIDIA-maintained utilities to streamline mixed precision (FP16) and distributed training in PyTorch. It offers custom CUDA/C++ extensions for optimized training performance, including fused kernels for attention and convolution operations. The project aims to make cutting-edge training utilities available to PyTorch users before upstream inclusion.