← all repositories

wandb/weave

A Weights & Biases toolkit for tracing, evaluating, and debugging language model applications.

1.1k stars Python LLMOps · EvalAgents
weave
Velocity · 7d
+1.0
★ / day
Trend
steady
star history

Weave is a development toolkit for generative AI applications that provides observability and debugging capabilities for language model inputs, outputs, and execution traces. It enables developers to build rigorous evaluations for LLM use cases and organize information across the full AI development lifecycle from experimentation through production. The toolkit supports tracing for major LLM providers including OpenAI, Anthropic, Google AI Studio, and Hugging Face.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.