← all repositories

HKUDS/VideoAgent

An agentic framework using multi-modal LLMs to understand, edit, and generate video content.

743 stars Python AgentsImage · Video · Audio
VideoAgent
Velocity · 7d
+2.3
★ / day
Trend
steady
star history

VideoAgent is an all-in-one agentic framework for comprehensive video intelligence. It combines understanding, editing, and remaking capabilities through multi-modal LLM agents. The framework enables intent analysis and autonomous tool use for video tasks, allowing users to articulate requirements and generate multi-modal products including detailed workflows and video overviews.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.