cyrusbehr/tensorrt-cpp-api
A modern C++ library that converts ONNX models to optimized NVIDIA TensorRT engines for high-performance GPU inference.

Velocity · 7d
+0.5
★ / day
Trend
→steady
star history
tensorrt_cpp_api provides a clean C++ interface for building and running TensorRT inference engines on NVIDIA GPUs. It handles ONNX model parsing, engine caching, and tensor management with caller-owned CUDA streams. The library exposes a simple name-keyed tensor API with explicit host/device transfers and a Result-based error model, targeting TensorRT 10+ and CUDA 12.