paradigmxyz/evmbench
A benchmark harness for running AI agents to find and exploit vulnerabilities in Ethereum smart contracts.

Velocity · 7d
+3.9
★ / day
Trend
→steady
star history
evmbench is a web-based evaluation framework that provides a UI for uploading Solidity smart contracts and running AI agents against them to generate structured vulnerability reports. It includes a frontend (Next.js), backend API (FastAPI), job queue (RabbitMQ), and containerized worker infrastructure (Docker/K8s). The system wraps OpenAI’s frontier-evals evaluation code as a pinned submodule for benchmarking agent performance on security tasks.