JShollaj/awesome-llm-interpretability
A curated list aggregating research tools, papers, and communities for interpreting and understanding Large Language Models.

Velocity · 7d
+1.7
★ / day
Trend
→steady
star history
This repository consolidates links to tools, academic papers, articles, and research groups working on LLM interpretability. It covers visualization tools like LIT and Phoenix, analysis frameworks like Pythia and Comgra, and open research from groups including EleutherAI and OpenAI. The list serves as a reference for researchers and engineers studying how transformer-based language models process and represent information.