onnx/onnx-tensorrt
ONNX-TensorRT provides a backend for parsing ONNX models and executing them with NVIDIA TensorRT optimization.

Velocity · 7d
+1.1
★ / day
Trend
→steady
star history
This project serves as an ONNX backend implementation that converts ONNX models for execution within NVIDIA TensorRT. It provides parsing libraries and executables to compile and run deep learning models with GPU acceleration. The project is tightly coupled with CUDA and requires specific TensorRT versions for proper compilation and execution.