caiyuanhao1998/Open-OmniVCus
A diffusion Transformer model for subject-driven video customization under multimodal control conditions, published at NeurIPS 2025.

Velocity · 7d
+1.5
★ / day
Trend
→steady
star history
OmniVCus is a video generation system that personalizes videos based on subject images while accepting multiple control conditions such as text prompts and reference poses. The implementation uses diffusion models and transformer architectures to enable feedforward video customization, trained on custom datasets and released with model weights on Hugging Face.