a-real-ai/pywinassistant
An open-source agentic framework that autonomously operates Windows graphical user interfaces through natural language by emulating human-computer interactions via OS accessibility APIs.

PyWinAssistant is an AI agent framework that navigates and controls Windows applications by interpreting natural language commands. It combines symbolic reasoning with visualization techniques to understand interface structure and execute multi-step tasks. The system bypasses traditional computer vision by directly extracting UI metadata through Windows Accessibility APIs, enabling precise spatial understanding of interface elements.