← all repositories

ethz-spylab/agentdojo

A benchmark suite for evaluating prompt injection attacks and defenses on large language model agents.

602 stars Python AgentsLLMOps · Eval
agentdojo
Velocity · 7d
+0.7
★ / day
Trend
steady
star history

AgentDojo provides a dynamic evaluation environment for testing the security of LLM agents against prompt injection attacks. It includes benchmark suites with predefined tasks and supports evaluating both attack strategies and defense mechanisms. The framework ships with a built-in prompt injection detector using transformer models to serve as a baseline defense.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.