ai-dynamo/nixl
NIXL is a C++ library for accelerating point-to-point communications in AI inference frameworks like NVIDIA Dynamo.

Velocity · 7d
+2.3
★ / day
Trend
→steady
star history
The NVIDIA Inference Xfer Library provides a modular plug-in architecture for abstracting memory (CPU, GPU) and storage (file, block, object store) access in AI inference workloads. It offers a Python API, telemetry and observability features, and benchmarks (NIXLBench, KVBench) for performance evaluation. The library is part of the broader NVIDIA AI/Dynamo ecosystem focused on inference acceleration.