visual-layer/fastdup
An open-source tool for analyzing image and video datasets to detect duplicates, similarities, and outliers using deep learning.

Velocity · 7d
+1.2
★ / day
Trend
→steady
star history
Fastdup is a dataset curation tool that uses deep learning to analyze large image and video collections at scale. It identifies duplicate images, visually similar content, and outliers to help improve data quality for machine learning pipelines. The tool integrates into data preparation workflows to reduce costs and enhance label quality.