← all repositories

UKGovernmentBEIS/inspect_ai

A framework for evaluating large language models created by the UK AI Security Institute.

2.2k stars Python LLMOps · Eval
inspect_ai
Velocity · 7d
+2.3
★ / day
Trend
steady
star history

Inspect is an open-source evaluation framework for large language models developed by the UK AI Security Institute. It provides built-in components for prompt engineering, tool usage, multi-turn dialog, and model-graded evaluations. The framework includes over 200 pre-built evaluations ready to run on any model and supports extensibility through external Python packages.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.