askui/python-sdk
A Python SDK enabling AI agents to automate UI interactions across desktop, mobile, and embedded devices using vision-based element detection and multi-model LLM support.

AskUI Python SDK is an automation framework that allows AI agents to control desktop, mobile, and HMI devices through natural language instructions. The framework combines computer vision for UI element detection with LLM integration for intelligent decision-making, supporting models like Anthropic Claude and Google Gemini. It replaces brittle selectors with vision-based automation, enabling AI agents to execute complex multi-step workflows across platforms.