CLUEbenchmark/CLUE
Chinese Language Understanding Evaluation benchmark providing datasets, baselines, pre-trained models, and leaderboard for evaluating NLP models.

CLUE is a comprehensive Chinese NLP benchmark that provides datasets, pre-trained models, baselines, and leaderboards for evaluating language models on various NLU tasks. The repository includes popular models like BERT, RoBERTa, ALBERT adapted for Chinese, along with evaluation datasets covering classification, inference, and other NLP tasks. It supports multiple deep learning frameworks including PyTorch, TensorFlow, and PaddlePaddle.