← all repositories

NVlabs/ToolOrchestra

NVIDIA's RL-based training framework for building LLM agents that orchestrate tool use and agentic workflows.

ToolOrchestra
Velocity · 7d
+3.8
★ / day
Trend
steady
star history

ToolOrchestra is a research framework from NVIDIA for training large language models to function as autonomous agents capable of orchestrating multiple tools in coordinated workflows. It provides an end-to-end reinforcement learning training pipeline that enables models to learn when and how to invoke external tools in sequence. The project includes both a training framework, a released 8B parameter model on HuggingFace, and a curated dataset called ToolScale for training such agent systems.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.