← all repositories

nv-tlabs/GEN3C

A generative video model that uses 3D scene information (depth and point clouds) to produce temporally consistent videos with precise camera control.

1.4k stars Jupyter Notebook Image · Video · Audio
GEN3C
Velocity · 7d
+2.9
★ / day
Trend
steady
star history

GEN3C is a video diffusion model that conditions generation on a 3D cache of point clouds derived from depth predictions. When generating new frames, the model renders the 3D cache from user-specified camera trajectories, ensuring world consistency without the model needing to remember prior frames. This approach addresses common video generation issues like objects appearing or disappearing inconsistently, and provides precise camera control rather than relying on the network to infer camera effects.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.