ethz-spylab/agentdojo
A benchmark suite for evaluating prompt injection attacks and defenses on large language model agents.

Velocity · 7d
+0.7
★ / day
Trend
→steady
star history
AgentDojo provides a dynamic evaluation environment for testing the security of LLM agents against prompt injection attacks. It includes benchmark suites with predefined tasks and supports evaluating both attack strategies and defense mechanisms. The framework ships with a built-in prompt injection detector using transformer models to serve as a baseline defense.