← all repositories
HKUDS/ViMax

An AI film crew that actually shows up to work

ViMax orchestrates director, screenwriter, and producer agents to generate multi-shot videos from raw ideas, novels, or scripts.

ViMax
Velocity · 7d
+21
★ / day
Trend
steady
star history

What it does

ViMax is a multi-agent framework that tries to automate the entire video pipeline: you feed it a concept, novel, or screenplay, and it handles script generation, storyboarding, character design, shot planning, and final video output. It targets the usual AI video pain points—short clips, inconsistent characters, and visual-only output—by adding narrative structure and audio integration.

The interesting bit

The framework explicitly breaks the work into film-production roles (Director, Screenwriter, Producer, Video Generator) rather than treating generation as a single prompt-to-pixels step. There’s also an AutoCameo mode that ingests your photo and inserts you as a recurring character—useful for personalized content, slightly unsettling for everything else.

Key highlights

  • Four input modes: Idea2Video, Novel2Video, Script2Video, and AutoCameo (photo-based guest star insertion)
  • RAG-based long-script engine for segmenting novels into multi-scene formats
  • Claims automated consistency checking for characters and environments across shots
  • Audio/video binding with character voice and sound effects
  • Python 3.12, uv-ready, MIT licensed

Caveats

  • README is heavy on vision and light on implementation details; architecture section is truncated and no technical report is released yet
  • “Agents Loop” and technical report are listed as “Coming Soon” (not shipped)
  • Actual model backends, compute requirements, and cost are unspecified

Verdict

Worth watching if you’re building agentic media pipelines or need long-form video generation with narrative structure. Skip if you need something production-ready today—the framework appears to be early-stage and the heavy marketing prose outpaces the documented engineering.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.