← all repositories

bytedance/UI-TARS-desktop

UI-TARS Desktop is a native desktop application providing a GUI agent based on the UI-TARS multimodal vision-language model for computer automation.

36.2k stars TypeScript AgentsCoding Assistants
UI-TARS-desktop
Velocity · 7d
+72
★ / day
Trend
steady
star history

The repository provides a multimodal AI agent stack comprising Agent TARS (CLI/Web UI agent) and UI-TARS Desktop (native desktop GUI agent). It leverages cutting-edge multimodal LLMs and vision models to automate computer tasks through a GUI agent that can interact with desktop interfaces. The stack integrates with MCP tools and is built on the open-source UI-TARS vision-language model.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.