MLGroupJLU/LLM-eval-survey
A survey paper repository collecting academic papers and resources on evaluating large language models across various benchmarks and assessment dimensions.

This repository serves as the official GitHub page for the survey paper ‘A Survey on Evaluation of Large Language Models’. It organizes papers and resources related to LLM evaluation, including benchmarks, assessment methodologies, and evaluation frameworks. The repository provides a structured collection of evaluation research with references to related projects like PromptBench and LLM-eval for robustness and general evaluation of LLMs.