verazuo/jailbreak_llms
A research dataset of 15,140 prompts including 1,405 jailbreak prompts collected from Reddit, Discord, websites, and open-source datasets for LLM security analysis.

This repository accompanies a CCS 2024 paper on characterizing in-the-wild jailbreak prompts on LLMs. The dataset contains prompts collected from December 2022 to December 2023 across four platforms. Researchers can use this dataset to study LLM safety, evaluate jailbreak susceptibility, and develop countermeasures. The repository includes Jupyter notebooks for data analysis and the full dataset on Hugging Face.