ai-dynamo/dynamo
A datacenter-scale distributed inference serving framework developed by NVIDIA for deploying and serving AI models across large-scale infrastructure.

Velocity · 7d
+16
★ / day
Trend
→steady
star history
Dynamo is a distributed inference serving framework designed to handle AI model inference at datacenter scale. Built in Rust for high performance, it provides infrastructure for deploying and managing model inference across distributed compute clusters. The project includes recipes, examples, prebuilt NGC containers, and comprehensive documentation to facilitate enterprise-grade AI inference deployments.