← all repositories

rhymes-ai/Aria

A multimodal native MoE language model with 25.3B total parameters and 64K token context window for vision-language understanding.

1.1k stars Jupyter Notebook Language ModelsInference · Serving
Aria
Velocity · 7d
+1.8
★ / day
Trend
steady
star history

Aria is an open multimodal native MoE model featuring 3.9B activated parameters per token and a 64K context window. It achieves state-of-the-art performance across language and multimodal tasks, with superior capabilities in video and document understanding. The codebase provides inference and fine-tuning capabilities through Hugging Face Transformers integration, supporting deployment on a single A100 GPU with bfloat16 precision.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.