← all repositories

menyifang/MIMO

A diffusion-based video synthesis model that generates controllable character videos from images and motion inputs.

1.6k stars Python Image · Video · Audio
MIMO
Velocity · 7d
+2.5
★ / day
Trend
steady
star history

MIMO is a generalizable model for controllable video synthesis that generates realistic character videos with controllable attributes including character identity, motion, and scene composition. It achieves scalability to arbitrary characters, generality to novel 3D motions, and applicability to interactive real-world scenes using spatial decomposed modeling within a diffusion model framework.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.