EvolvingLMMs-Lab/lmms-engine
A unified training engine for multimodal large language models supporting video generation and vision-language tasks.

Velocity · 7d
+2.5
★ / day
Trend
→steady
star history
lmms-engine is a lean, flexible training framework designed for training unified multimodal models at scale. It supports various multimodal architectures combining language, vision, and video modalities. The engine provides comprehensive MFU (Model FLOPs Utilization) metrics for benchmarking different model architectures and training configurations.