open-edge-platform/datumaro
A Python framework and CLI tool for building, converting, and analyzing computer vision datasets for deep learning.

Velocity · 7d
+0.3
★ / day
Trend
→steady
star history
Datumaro is a dataset management library that enables reading, writing, and converting between popular computer vision dataset formats including COCO, VOC, YOLO, ImageNet, and Cityscapes. It provides CLI tooling and Python APIs for dataset transformation, statistics analysis, and pipeline orchestration. The framework integrates with annotation tools and connects to model training workflows.