HKUDS/VideoAgent
An agentic framework using multi-modal LLMs to understand, edit, and generate video content.

Velocity · 7d
+2.3
★ / day
Trend
→steady
star history
VideoAgent is an all-in-one agentic framework for comprehensive video intelligence. It combines understanding, editing, and remaking capabilities through multi-modal LLM agents. The framework enables intent analysis and autonomous tool use for video tasks, allowing users to articulate requirements and generate multi-modal products including detailed workflows and video overviews.