← all repositories

GeorgeCazenavette/mtt-distillation

Research code for Dataset Distillation by Matching Training Trajectories, which learns synthetic images to replace real datasets for training computer vision models.

441 stars Python Computer VisionData Tooling
mtt-distillation
Velocity · 7d
+0.3
★ / day
Trend
steady
star history

This repository implements a dataset distillation method published at CVPR 2022. The approach learns a small set of synthetic images such that models trained exclusively on them achieve similar test performance to models trained on the full real dataset. It works by optimizing synthetic data to induce similar training dynamics in student networks as expert networks trained on real data, measuring error in parameter space across training iterations.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.