brightmart/roberta_zh
A Chinese RoBERTa pre-trained language model implementation in TensorFlow and PyTorch.

Velocity · 7d
+1.1
★ / day
Trend
→steady
star history
This repository provides pre-trained RoBERTa models for Chinese language processing. It includes implementations for both TensorFlow and PyTorch frameworks. The models were trained on approximately 30GB of Chinese text data comprising nearly 300 million sentences and 10 billion Chinese tokens. Available model variants include 6-layer and 24/12-layer versions, compatible with standard Bert loading mechanisms.