truera/trulens
An evaluation and observability framework for systematically assessing LLM applications and AI agents.

Velocity · 7d
+1.6
★ / day
Trend
→steady
star history
TruLens provides fine-grained, stack-agnostic instrumentation for evaluating and tracking LLM experiments and AI agents. It offers configurable feedback functions and evaluation frameworks including the RAG Triad and Honest-Harmless-Helpful metrics to assess performance and identify failure modes. The tool enables systematic iteration on prompts, models, retrievers, and knowledge sources throughout the development workflow.