Morizeyao/GPT2-Chinese
Chinese GPT-2 implementation for training and generating text in Chinese using BERT tokenizer or BPE tokenization

Velocity · 7d
+3.0
★ / day
Trend
→steady
star history
Provides training code for Chinese GPT-2 language models. Supports multiple tokenization approaches including BERT tokenizer, BPE, character-level and word-level tokenization. Includes pre-trained models for general Chinese text, lyrics, and classical Chinese writing. Built on HuggingFace Transformers library and PyTorch.