← all repositories

Morizeyao/GPT2-Chinese

Chinese GPT-2 implementation for training and generating text in Chinese using BERT tokenizer or BPE tokenization

7.6k stars Python Language Models
GPT2-Chinese
Velocity · 7d
+3.0
★ / day
Trend
steady
star history

Provides training code for Chinese GPT-2 language models. Supports multiple tokenization approaches including BERT tokenizer, BPE, character-level and word-level tokenization. Includes pre-trained models for general Chinese text, lyrics, and classical Chinese writing. Built on HuggingFace Transformers library and PyTorch.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.