meta-llama/PurpleLlama
A suite of tools, benchmarks, and safeguards from Meta for evaluating and improving the cybersecurity risks and safety of open generative AI models.

Velocity · 7d
+4.6
★ / day
Trend
→steady
star history
Purple Llama is an umbrella project offering tools and evaluations for responsible generative AI development. It includes CyberSec Eval, a benchmark for assessing cybersecurity risks in LLMs, and Llama Guard, an input/output safeguard model. The project adopts a purple-team approach combining offensive (red) and defensive (blue) security postures to comprehensively evaluate and mitigate risks in AI systems.