showlab/ShowUI
A 2B-parameter Vision-Language-Action model designed for GUI agents to interact with digital interfaces autonomously.

Velocity · 7d
+3.2
★ / day
Trend
→steady
star history
ShowUI is an open-source, end-to-end Vision-Language-Action model specialized for GUI agents and computer use tasks. It processes visual interface screenshots and generates actionable outputs to interact with desktop and web environments. The model is available as ShowUI-2B on HuggingFace with supporting datasets and integration with computer use frameworks.