IPADS-SAI/MobiAgent
A customizable mobile agent system for intelligent GUI interaction on smartphones using custom vision-language models.

MobiAgent is a systematic framework for building intelligent mobile agents that interact with smartphone GUIs. It includes a proprietary vision-language model family called MobiMind, an agent acceleration framework called AgentRR, and a benchmark suite called MobiFlow. The system supports on-device inference on smartphones and provides a unified runner for configuring and running multiple GUI agent models including UI-TARS, AutoGLM, and Qwen-VL.