gszfwsb/NCFM
PyTorch implementation of a CVPR 2025 paper that distills large image datasets into compact synthetic datasets using neural characteristic function matching.

This repository provides the official implementation of NCFM (Neural Characteristic Function Matching), a dataset distillation method published at CVPR 2025. The method synthesizes small distilled datasets that can effectively train models, significantly reducing data requirements while maintaining performance. It uses neural characteristic functions and minmax optimization to match the statistical properties of the original dataset, applicable to standard benchmarks like CIFAR-10/100 and ImageNet.