rednote-hilab/dots.llm1
A 142B-parameter MoE language model with 14B activated parameters released as base and instruct variants by rednote-hilab.
★490 stars Language Models

Velocity · 7d
+1.3
★ / day
Trend
→steady
star history
dots.llm1 is a large-scale Mixture of Experts language model achieving performance comparable to Qwen2.5-72B through pretraining on high-quality corpus without synthetic data. The repository provides access to both base and instruction-tuned model variants along with intermediate training checkpoints spanning the full training process to support research into large language model learning dynamics.