← all repositories

alibaba/Tora

A trajectory-oriented Diffusion Transformer model for high-quality text-to-video and image-to-video generation.

1.2k stars Python Image · Video · Audio
Tora
Velocity · 7d
+2.1
★ / day
Trend
steady
star history

Tora is a video generation model based on Diffusion Transformer architecture, published at CVPR 2025. It enables text-to-video and image-to-video generation with trajectory control mechanisms. The model is available on both ModelScope and HuggingFace with implementations supporting SAT and Diffusers frameworks.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.