← all repositories

vignshwarar/AI-Employe

Browser automation agent that uses GPT-4 Vision to autonomously navigate and interact with web pages.

584 stars TypeScript AgentsRAG · Search
AI-Employe
Velocity · 7d
+0.6
★ / day
Trend
steady
star history

This project builds a browser extension that acts as an autonomous agent by leveraging GPT-4 Vision to visually understand and control web pages. It solves the reliability problem of element selection by indexing the entire DOM in MeiliSearch, allowing the AI to generate commands referencing specific element text rather than coordinates. Users teach the agent tasks as if training a human, and it executes them autonomously in the browser.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.