← all repositories

unsplash/datasets

A public dataset of over 6.5 million high-quality Unsplash photos with keywords and search data for machine learning research.

2.7k stars Jupyter Notebook Data Tooling
datasets
Velocity · 7d
+1.3
★ / day
Trend
steady
star history

The Unsplash Dataset provides hundreds of thousands to millions of photos sourced from the Unsplash platform, along with associated keywords and search queries, for non-commercial and commercial research use. The Lite dataset offers 25k photos for commercial use while the Full dataset contains 6.5M+ photos for non-commercial use. The repository includes Jupyter Notebooks and documentation to help researchers download and work with the structured image data for computer vision, semantic search, and other ML tasks.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.