yandex/YaLM-100B
Yandex's 100-billion parameter GPT-like pretrained language model for text generation and processing.

Velocity · 7d
+2.6
★ / day
Trend
→steady
star history
YaLM 100B is a large-scale pretrained language model trained on 1.7TB of online texts, books, and other sources in English and Russian. The repository provides inference code based on DeepSpeed ZeRO-3 and supports multi-GPU tensor parallelism. It includes Docker images and scripts for downloading model weights and vocabulary, enabling deployment on GPU clusters.