several27/FakeNewsCorpus
A labeled corpus of millions of news articles for fake news detection training.
★413 stars Data Tooling

Velocity · 7d
+0.1
★ / day
Trend
→steady
star history
This repository hosts a large-scale news article dataset scraped from over 1000 domains, curated for fake news recognition tasks. Articles are labeled by source type (fake, bias, reliable, etc.) and include extracted metadata such as title, authors, content, and keywords. The corpus is formatted as CSV and intended for training NLP and deep learning classification models.