← all repositories

tensorflow/datasets

TensorFlow Datasets provides a catalog of hundreds of public datasets formatted as tf.data.Datasets for use with TensorFlow, Jax, and NumPy.

4.6k stars Python Data Tooling
datasets
Velocity · 7d
+1.6
★ / day
Trend
steady
star history

TFDS is a repository of ready-to-use machine learning datasets, including image, text, audio, and video collections. It provides standardized dataset loading, automatic downloads, and pre-processing utilities, allowing researchers and developers to quickly assemble data pipelines for model training. The library follows ML best practices including performance optimizations via tf.data and supports shuffle batching, prefetching, and caching.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.