← all repositories

yandex-research/tab-ddpm

A diffusion model implementation for generating synthetic tabular datasets.

553 stars Python Data ToolingML Frameworks
tab-ddpm
Velocity · 7d
+0.4
★ / day
Trend
steady
star history

TabDDPM implements a denoising diffusion probabilistic model specifically designed for tabular data synthesis. The approach trains a neural network to reverse a gradual noising process, enabling high-quality generation of synthetic tabular datasets. The implementation includes training pipelines, hyperparameter tuning scripts, and evaluation tools for comparing synthetic data quality against real data.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.