wandb/weave
A Weights & Biases toolkit for tracing, evaluating, and debugging language model applications.

Velocity · 7d
+1.0
★ / day
Trend
→steady
star history
Weave is a development toolkit for generative AI applications that provides observability and debugging capabilities for language model inputs, outputs, and execution traces. It enables developers to build rigorous evaluations for LLM use cases and organize information across the full AI development lifecycle from experimentation through production. The toolkit supports tracing for major LLM providers including OpenAI, Anthropic, Google AI Studio, and Hugging Face.