← all repositories

XueFuzhao/OpenMoE

OpenMoE releases open-sourced Mixture-of-Experts Large Language Models, including an 8B base model and chat model.

1.7k stars Python Language Models
OpenMoE
Velocity · 7d
+1.6
★ / day
Trend
steady
star history

OpenMoE is a family of open-sourced Mixture-of-Experts (MoE) Large Language Models. The project releases model weights, training data, strategies, and model architecture to the community. It includes OpenMoE-base and 8B variants, with explorations toward 34B scale models. The team provides training code and checkpoints along with a research paper analyzing routing behavior in MoE architectures.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.