← all repositories

williamyang1991/Rerender_A_Video

A zero-shot text-guided video-to-video translation system using adapted diffusion models with hierarchical cross-frame constraints.

3k stars Jupyter Notebook Image · Video · AudioInference · Serving
Rerender_A_Video
Velocity · 7d
+2.7
★ / day
Trend
steady
star history

Rerender A Video adapts large text-to-image diffusion models for video domain translation while maintaining temporal consistency. The framework first translates key frames using an adapted diffusion model with cross-frame constraints for shape, texture, and color coherence, then propagates results to full videos via temporal-aware patch matching and frame blending. Implemented in PyTorch, it enables style transfer and content manipulation on videos using natural language prompts without training on target videos.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.