← all repositories

HowieHwong/TrustLLM

A comprehensive benchmark and evaluation toolkit for assessing trustworthiness dimensions in large language models.

TrustLLM
Velocity · 7d
+0.7
★ / day
Trend
steady
star history

TrustLLM is a benchmark framework for evaluating trustworthiness in LLMs across multiple dimensions such as fairness, safety, robustness, and transparency. It provides a standardized evaluation toolkit with an associated dataset, metrics, and leaderboard to compare model performance on trustworthiness tasks. The toolkit enables researchers to systematically assess and compare different language models’ adherence to trustworthiness principles.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.