← all repositories

ServiceNow/AgentLab

An open-source framework for developing, testing, and benchmarking web agents built on LLMs, designed for scalability and reproducibility.

585 stars Python AgentsLLMOps · Eval
AgentLab
Velocity · 7d
+0.8
★ / day
Trend
steady
star history

AgentLab provides infrastructure to build custom web agents, run experiments across diverse benchmarks, and evaluate performance using BrowserGym. It includes tools for launching experiments, analyzing results, and comparing agent performance via leaderboards. The framework is built around BrowserGym and supports multiple LLM backends for agent development and systematic evaluation.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.