TencentQQGYLab/AppAgent
A multimodal agent framework enabling LLMs to autonomously navigate and interact with smartphone applications.

Velocity · 7d
+7.5
★ / day
Trend
→steady
star history
AppAgent is a multimodal agent system designed to operate smartphone applications using large language models. It provides a framework where agents can perceive the smartphone environment, make decisions, and execute actions to complete tasks on mobile apps. The system uses vision-language models to understand GUI elements and navigates through apps autonomously without requiring custom API integrations.