← all repositories

cybertronai/gradient-checkpointing

A TensorFlow library that reduces neural network training memory usage from O(n) to O(sqrt(n)) by checkpointing computation graph nodes and recomputing intermediate values during backpropagation.

2.8k stars Python ML Frameworks
gradient-checkpointing
Velocity · 7d
+0.9
★ / day
Trend
steady
star history

This repository provides a memory optimization technique for training deep neural networks in TensorFlow. It works by strategically placing checkpoints in the computation graph and recomputing intermediate activations during backpropagation instead of storing them all in memory. The implementation uses the TensorFlow graph editor to automatically rewrite the backward pass computation graph. For feed-forward networks, this allows fitting models more than 10x larger onto a GPU with approximately 20% additional computation time.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.