← all repositories

imcaspar/gpt2-ml

A multilingual GPT-2 implementation providing 1.5B parameter Chinese pretrained models with TPU training support.

1.7k stars Python Language Models
gpt2-ml
Velocity · 7d
+0.7
★ / day
Trend
steady
star history

This repository provides GPT-2 adapted for multiple languages, with a focus on Chinese. It includes 1.5 billion parameter pretrained Chinese language models trained on large corpora (~15G and ~30G text), along with training scripts supporting TPU execution based on Grover, and ported BERT tokenizers for multilingual compatibility. The project offers ready-to-use Colab demos for model inference.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.