← all repositories

brightmart/roberta_zh

A Chinese RoBERTa pre-trained language model implementation in TensorFlow and PyTorch.

2.8k stars Python Language Models
roberta_zh
Velocity · 7d
+1.1
★ / day
Trend
steady
star history

This repository provides pre-trained RoBERTa models for Chinese language processing. It includes implementations for both TensorFlow and PyTorch frameworks. The models were trained on approximately 30GB of Chinese text data comprising nearly 300 million sentences and 10 billion Chinese tokens. Available model variants include 6-layer and 24/12-layer versions, compatible with standard Bert loading mechanisms.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.