← all repositories

yandex/YaLM-100B

Yandex's 100-billion parameter GPT-like pretrained language model for text generation and processing.

3.8k stars Python Language Models
YaLM-100B
Velocity · 7d
+2.6
★ / day
Trend
steady
star history

YaLM 100B is a large-scale pretrained language model trained on 1.7TB of online texts, books, and other sources in English and Russian. The repository provides inference code based on DeepSpeed ZeRO-3 and supports multi-GPU tensor parallelism. It includes Docker images and scripts for downloading model weights and vocabulary, enabling deployment on GPU clusters.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.