← all repositories

HazyResearch/data-centric-ai

A curated collection of resources, papers, and guides about data-centric AI methodologies from Stanford's HazyResearch group.

1.1k stars TeX LearningData Tooling
data-centric-ai
Velocity · 7d
+0.6
★ / day
Trend
steady
star history

This repository consolidates resources on data-centric AI, an approach that prioritizes improving data quality over model architecture. It collects papers, blog posts, and research progress in techniques such as data labeling, curation, and validation that help ML practitioners achieve better real-world results. The project originated from Stanford’s HazyResearch group and includes contributions from the broader data-centric AI community.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.