vectara/hallucination-leaderboard
A public leaderboard that evaluates and ranks LLMs based on hallucination rates when summarizing documents.

The repository provides a standardized evaluation framework for measuring how often different LLMs introduce hallucinations during document summarization. It uses Vectara’s Hallucination Evaluation Model (HHEM) to score models on metrics including hallucination rate, factual consistency rate, and answer rate. Results are regularly updated and presented as an interactive leaderboard on Hugging Face with tabular data showing performance across dozens of LLMs.