vignshwarar/AI-Employe
Browser automation agent that uses GPT-4 Vision to autonomously navigate and interact with web pages.

Velocity · 7d
+0.6
★ / day
Trend
→steady
star history
This project builds a browser extension that acts as an autonomous agent by leveraging GPT-4 Vision to visually understand and control web pages. It solves the reliability problem of element selection by indexing the entire DOM in MeiliSearch, allowing the AI to generate commands referencing specific element text rather than coordinates. Users teach the agent tasks as if training a human, and it executes them autonomously in the browser.