← all repositories

TencentQQGYLab/AppAgent

A multimodal agent framework enabling LLMs to autonomously navigate and interact with smartphone applications.

6.8k stars Python Agents
AppAgent
Velocity · 7d
+7.5
★ / day
Trend
steady
star history

AppAgent is a multimodal agent system designed to operate smartphone applications using large language models. It provides a framework where agents can perceive the smartphone environment, make decisions, and execute actions to complete tasks on mobile apps. The system uses vision-language models to understand GUI elements and navigates through apps autonomously without requiring custom API integrations.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.