alibaba/Tora
A trajectory-oriented Diffusion Transformer model for high-quality text-to-video and image-to-video generation.

Velocity · 7d
+2.1
★ / day
Trend
→steady
star history
Tora is a video generation model based on Diffusion Transformer architecture, published at CVPR 2025. It enables text-to-video and image-to-video generation with trajectory control mechanisms. The model is available on both ModelScope and HuggingFace with implementations supporting SAT and Diffusers frameworks.