← all repositories

ServiceNow/BrowserGym

BrowserGym is a benchmark environment for training, evaluating, and comparing autonomous web agents powered by LLMs.

1.2k stars Python AgentsLLMOps · Eval
BrowserGym
Velocity · 7d
+1.5
★ / day
Trend
steady
star history

BrowserGym provides an open framework for web agent research, implementing environments where AI agents can interact with websites to complete tasks. It offers standardized benchmarks including WebArena and WorkArena for evaluating agent performance. The project integrates with AgentLab for implementing and testing web agents and supports multimodal agents using vision-language models.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.