NVIDIA/TensorRT
NVIDIA SDK for optimizing and executing deep learning model inference on NVIDIA GPUs.

Velocity · 7d
+5.0
★ / day
Trend
→steady
star history
TensorRT provides tools and libraries for optimizing and deploying neural network models for high-performance inference. It converts trained models into optimized inference engines, applies quantization and layer fusion optimizations, and executes them efficiently on NVIDIA GPU hardware. This open source repository includes plugins, ONNX parser components, and sample applications demonstrating inference capabilities.