← all repositories

NVIDIA/TensorRT

NVIDIA SDK for optimizing and executing deep learning model inference on NVIDIA GPUs.

13k stars C++ Inference · Serving
TensorRT
Velocity · 7d
+5.0
★ / day
Trend
steady
star history

TensorRT provides tools and libraries for optimizing and deploying neural network models for high-performance inference. It converts trained models into optimized inference engines, applies quantization and layer fusion optimizations, and executes them efficiently on NVIDIA GPU hardware. This open source repository includes plugins, ONNX parser components, and sample applications demonstrating inference capabilities.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.