imcaspar/gpt2-ml
A multilingual GPT-2 implementation providing 1.5B parameter Chinese pretrained models with TPU training support.

Velocity · 7d
+0.7
★ / day
Trend
→steady
star history
This repository provides GPT-2 adapted for multiple languages, with a focus on Chinese. It includes 1.5 billion parameter pretrained Chinese language models trained on large corpora (~15G and ~30G text), along with training scripts supporting TPU execution based on Grover, and ported BERT tokenizers for multilingual compatibility. The project offers ready-to-use Colab demos for model inference.