qualifire-dev/rogue
A security-focused evaluation platform for stress-testing and red-teaming AI agents against policies and vulnerabilities.

Velocity · 7d
+2.8
★ / day
Trend
→steady
star history
Rogue provides two evaluation modes for AI agents: automatic evaluation that verifies compliance with business policies and expected behaviors, and red teaming that simulates adversarial attacks using 75+ vulnerability categories and 8 compliance frameworks. The platform features a client-server architecture with a terminal interface and CLI for CI/CD integration, offering detailed pass/fail reports and CVSS-based risk scoring.