← all repositories

verazuo/jailbreak_llms

A research dataset of 15,140 prompts including 1,405 jailbreak prompts collected from Reddit, Discord, websites, and open-source datasets for LLM security analysis.

3.7k stars Jupyter Notebook Data ToolingLLMOps · Eval
jailbreak_llms
Velocity · 7d
+3.5
★ / day
Trend
steady
star history

This repository accompanies a CCS 2024 paper on characterizing in-the-wild jailbreak prompts on LLMs. The dataset contains prompts collected from December 2022 to December 2023 across four platforms. Researchers can use this dataset to study LLM safety, evaluate jailbreak susceptibility, and develop countermeasures. The repository includes Jupyter notebooks for data analysis and the full dataset on Hugging Face.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.