← all repositories

visual-layer/fastdup

An open-source tool for analyzing image and video datasets to detect duplicates, similarities, and outliers using deep learning.

1.9k stars Python Data ToolingComputer Vision
fastdup
Velocity · 7d
+1.2
★ / day
Trend
steady
star history

Fastdup is a dataset curation tool that uses deep learning to analyze large image and video collections at scale. It identifies duplicate images, visually similar content, and outliers to help improve data quality for machine learning pipelines. The tool integrates into data preparation workflows to reduce costs and enhance label quality.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.