HowieHwong/TrustLLM
A comprehensive benchmark and evaluation toolkit for assessing trustworthiness dimensions in large language models.

Velocity · 7d
+0.7
★ / day
Trend
→steady
star history
TrustLLM is a benchmark framework for evaluating trustworthiness in LLMs across multiple dimensions such as fairness, safety, robustness, and transparency. It provides a standardized evaluation toolkit with an associated dataset, metrics, and leaderboard to compare model performance on trustworthiness tasks. The toolkit enables researchers to systematically assess and compare different language models’ adherence to trustworthiness principles.