← all repositories

ai-dynamo/dynamo

A datacenter-scale distributed inference serving framework developed by NVIDIA for deploying and serving AI models across large-scale infrastructure.

dynamo
Velocity · 7d
+16
★ / day
Trend
steady
star history

Dynamo is a distributed inference serving framework designed to handle AI model inference at datacenter scale. Built in Rust for high performance, it provides infrastructure for deploying and managing model inference across distributed compute clusters. The project includes recipes, examples, prebuilt NGC containers, and comprehensive documentation to facilitate enterprise-grade AI inference deployments.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.