← all repositories

UCSC-VLAA/story-iter

A training-free iterative diffusion model framework for generating coherent image sequences from long narrative text.

958 stars Python Image · Video · Audio
story-iter
Velocity · 7d
+1.4
★ / day
Trend
steady
star history

Story-Iter is a research implementation for long story visualization using diffusion models. It introduces an external iterative paradigm that refines each generated image by incorporating reference images from previous rounds, addressing semantic consistency and fine-grained interaction challenges in multi-frame visual storytelling. The framework proposes a plug-and-play global reference cross-attention mechanism that operates without additional training.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.