← all repositories

truera/trulens

An evaluation and observability framework for systematically assessing LLM applications and AI agents.

3.4k stars Python LLMOps · EvalAgents
trulens
Velocity · 7d
+1.6
★ / day
Trend
steady
star history

TruLens provides fine-grained, stack-agnostic instrumentation for evaluating and tracking LLM experiments and AI agents. It offers configurable feedback functions and evaluation frameworks including the RAG Triad and Honest-Harmless-Helpful metrics to assess performance and identify failure modes. The tool enables systematic iteration on prompts, models, retrievers, and knowledge sources throughout the development workflow.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.