thu-coai/Safety-Prompts
A dataset of 100k Chinese safety prompts with ChatGPT responses for evaluating and improving LLM safety alignment.

Velocity · 7d
+1.0
★ / day
Trend
→steady
star history
This repository provides a large collection of Chinese safety prompts designed for evaluating and improving the safety of large language models. It includes 100k samples covering various safety scenarios and instruction attacks, paired with ChatGPT responses. The dataset can be used to comprehensively assess model safety and enhance model knowledge about safety topics, helping align model outputs with human values.