← all repositories

qualifire-dev/rogue

A security-focused evaluation platform for stress-testing and red-teaming AI agents against policies and vulnerabilities.

1k stars Python LLMOps · EvalAgents
rogue
Velocity · 7d
+2.8
★ / day
Trend
steady
star history

Rogue provides two evaluation modes for AI agents: automatic evaluation that verifies compliance with business policies and expected behaviors, and red teaming that simulates adversarial attacks using 75+ vulnerability categories and 8 compliance frameworks. The platform features a client-server architecture with a terminal interface and CLI for CI/CD integration, offering detailed pass/fail reports and CVSS-based risk scoring.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.