← all repositories

bigscience-workshop/xmtf

Multitask finetuning framework for creating multilingual instruction-tuned models BLOOMZ and mT0 from T5/BLOOM checkpoints.

536 stars Jupyter Notebook Language ModelsData Tooling
xmtf
Velocity · 7d
+0.4
★ / day
Trend
steady
star history

This repository accompanies the paper Crosslingual Generalization through Multitask Finetuning and provides all components used to create the BLOOMZ and mT0 model families. It includes the xP3 multilingual dataset for multitask training, training scripts for finetuning T5-based and BLOOM-based models on diverse tasks across 46+ languages, and evaluation pipelines for assessing crosslingual generalization capabilities. The work enables zero-shot transfer of language models to non-English tasks through instruction tuning.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.