Alpha-VLLM/Lumina-T2X
A unified text-to-any-modality generation framework using flow-based large diffusion transformers.

Velocity · 7d
+2.8
★ / day
Trend
→steady
star history
Lumina-T2X is a generative AI framework that transforms text descriptions into images, videos, audio, and 3D content using flow-matching diffusion transformers. It supports variable resolution and duration generation across modalities. The project includes model weights, training code, and inference pipelines. It builds on the VLLM ecosystem for efficient inference.