← all repositories

gszfwsb/NCFM

PyTorch implementation of a CVPR 2025 paper that distills large image datasets into compact synthetic datasets using neural characteristic function matching.

413 stars Python Data ToolingComputer Vision
NCFM
Velocity · 7d
+0.9
★ / day
Trend
steady
star history

This repository provides the official implementation of NCFM (Neural Characteristic Function Matching), a dataset distillation method published at CVPR 2025. The method synthesizes small distilled datasets that can effectively train models, significantly reducing data requirements while maintaining performance. It uses neural characteristic functions and minmax optimization to match the statistical properties of the original dataset, applicable to standard benchmarks like CIFAR-10/100 and ImageNet.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.