← all repositories

hitsz-ids/synthetic-data-generator

A framework that uses GANs and deep learning to generate high-quality synthetic tabular data.

2.4k stars Python Data Tooling
synthetic-data-generator
Velocity · 7d
+2.3
★ / day
Trend
steady
star history

SDG is a specialized framework designed to generate high-quality structured tabular data using deep learning techniques, specifically GANs. It addresses privacy concerns by producing synthetic datasets that can substitute real data for ML training. The framework is part of an academic research initiative (HITSZ-IDS) and includes pre-commit hooks and CI testing infrastructure.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.