← all repositories

ymcui/cmrc2018

A Chinese span-extraction reading comprehension dataset for training and evaluating question-answering models.

454 stars Python Data Tooling
cmrc2018
Velocity · 7d
+0.1
★ / day
Trend
steady
star history

CMRC 2018 is a benchmark dataset released at EMNLP-IJCNLP 2019 for Chinese machine reading comprehension, specifically designed for span-extraction question answering tasks. The repository provides training, dev, and test data along with submission guidelines through CodaLab and a Hugging Face datasets integration. It includes a public leaderboard tracking state-of-the-art systems on this benchmark.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.