← all repositories

several27/FakeNewsCorpus

A labeled corpus of millions of news articles for fake news detection training.

413 stars Data Tooling
FakeNewsCorpus
Velocity · 7d
+0.1
★ / day
Trend
steady
star history

This repository hosts a large-scale news article dataset scraped from over 1000 domains, curated for fake news recognition tasks. Articles are labeled by source type (fake, bias, reliable, etc.) and include extracted metadata such as title, authors, content, and keywords. The corpus is formatted as CSV and intended for training NLP and deep learning classification models.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.