AI short drama factory: from prompt to TikTok-ready video
A TypeScript stack that automates scriptwriting, storyboarding, and video synthesis for the short-drama gold rush.

What it does Huobao Drama is a full-stack TypeScript platform that turns a one-line prompt into a complete short drama. It chains LLMs for script rewriting and scene extraction, image models for character design, and video models for shot generation, then stitches everything together with FFmpeg and TTS into a final episode.
The interesting bit The backend runs five specialized Mastra AI agents—script rewriter, extractor, storyboard breaker, voice assigner, and prompt generator—that hand off work like an actual film crew. The frontend is pure CSS, no UI framework, which is either admirably disciplined or stubbornly minimal depending on your taste.
Key highlights
- Supports multiple Chinese and international providers: OpenAI, Gemini, MiniMax, Volcano Engine, Alibaba, Vidu, Seedance
- Docker image bundles FFmpeg and merges frontend/backend into a single port
- SQLite with WAL mode; tables auto-create on first boot
- “Grid image” generation splits composite character sheets for reuse across shots
- Runtime skill loading via
skills/SKILL.mdfiles
Caveats
- CC BY-NC-SA 4.0 license means commercial use requires negotiation
- TTS only supports MiniMax; no alternative voice providers visible
- Cloud deployment pitch recommends a 3080Ti, though the stack itself is CPU-friendly until video generation hits external APIs
Verdict Worth a spin if you’re experimenting with AI-generated vertical video or building a short-drama pipeline. Skip it if you need broadcast-grade control over every shot—this is automation-first, finesse-second.