showlab/Show-1
A text-to-video generation model that combines pixel-level and latent diffusion architectures for synthesizing videos from text prompts.

Velocity · 7d
+1.2
★ / day
Trend
→steady
star history
Show-1 is a research-grade text-to-video generation system that merges pixel-based and latent diffusion models to produce coherent videos from textual descriptions. The approach leverages both pixel-space and latent-space diffusion processes to balance quality and efficiency. Released with code and model weights, the project enables researchers to reproduce video synthesis results and build upon the framework.