← all repositories

Alpha-VLLM/Lumina-T2X

A unified text-to-any-modality generation framework using flow-based large diffusion transformers.

2.3k stars Python Image · Video · Audio
Lumina-T2X
Velocity · 7d
+2.8
★ / day
Trend
steady
star history

Lumina-T2X is a generative AI framework that transforms text descriptions into images, videos, audio, and 3D content using flow-matching diffusion transformers. It supports variable resolution and duration generation across modalities. The project includes model weights, training code, and inference pipelines. It builds on the VLLM ecosystem for efficient inference.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.