UKGovernmentBEIS/inspect_ai
A framework for evaluating large language models created by the UK AI Security Institute.

Velocity · 7d
+2.3
★ / day
Trend
→steady
star history
Inspect is an open-source evaluation framework for large language models developed by the UK AI Security Institute. It provides built-in components for prompt engineering, tool usage, multi-turn dialog, and model-graded evaluations. The framework includes over 200 pre-built evaluations ready to run on any model and supports extensibility through external Python packages.