← all repositories

steel-dev/surf.new

An open-source platform for testing and evaluating AI agents that autonomously browse and interact with websites.

512 stars TypeScript AgentsLLMOps · Eval
surf.new
Velocity · 7d
+1.0
★ / day
Trend
steady
star history

surf.new provides a controlled environment to visualize and benchmark how different AI agents (Claude, GPT-4, etc.) navigate and interact with the web. It combines a Next.js frontend with a Python backend, leveraging the Steel SDK for browser automation capabilities. The project allows developers to compare agent performance across various AI providers while observing real-time browser interactions.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.