Is onnx-tensorrt open source?

Yes — onnx/onnx-tensorrt is open source, released under the Apache-2.0 license.

What language is onnx-tensorrt written in?

onnx/onnx-tensorrt is primarily written in C++.

How popular is onnx-tensorrt?

onnx/onnx-tensorrt has 3.2k stars on GitHub.

Where can I find onnx-tensorrt?

onnx/onnx-tensorrt is on GitHub at https://github.com/onnx/onnx-tensorrt.

← all repositories

onnx/onnx-tensorrt

The parser that marries ONNX to NVIDIA GPUs

Translates standard ONNX models into TensorRT execution plans so you can run open-format deep-learning graphs on NVIDIA silicon without manual layer translation.

★3.2k stars C++ Inference · Serving ML Frameworks

View on GitHub ↗

Not currently ranked — collecting fresh signals.

star history

What it does This is official glue code: a C++ parser library that ingests ONNX models and hands them off to TensorRT for optimized GPU inference. It ships as libnvonnxparser.so with Python bindings and backend wrappers, effectively letting you treat TensorRT as an execution engine for standard ONNX files. If you have an NVIDIA stack and an .onnx file, this is the supported path to avoid rebuilding your graph by hand.

The interesting bit The README gets oddly specific about InstanceNormalization, offering two implementations—native TensorRT and a plugin-based variant—because the faster choice depends on your specific parameters. That level of operator-level micromanagement, plus separate official validation tools (trtexec for C++ and polygraphy for Python), suggests this is production infrastructure rather than a weekend hack.

Key highlights

C++ core (libnvonnxparser.so) with a Python backend wrapper
Supports full-dimensions and dynamic input shapes
Operator coverage is selective; a dedicated matrix tracks which ONNX ops map to TensorRT kernels
Two official validation paths: trtexec for C++ users and polygraphy for Python users
Targets the latest TensorRT release with full-dimensions and dynamic shape support

Caveats

Operator support is partial; unsupported ONNX layers will fail at parse time
The plugin-based InstanceNormalization implementation cannot be used when building version-compatible or hardware-compatible engines
Requires the full TensorRT and CUDA toolchains; this is not a standalone runtime

Verdict Useful if you are already locked into NVIDIA’s inference stack and want to feed standard ONNX models into TensorRT without hand-rolling layers. If you are not on CUDA, or need guaranteed compliance across the entire ONNX operator spec, this parser will not help.

Frequently asked

What is onnx/onnx-tensorrt?: Translates standard ONNX models into TensorRT execution plans so you can run open-format deep-learning graphs on NVIDIA silicon without manual layer translation.
Is onnx-tensorrt open source?: Yes — onnx/onnx-tensorrt is open source, released under the Apache-2.0 license.
What language is onnx-tensorrt written in?: onnx/onnx-tensorrt is primarily written in C++.
How popular is onnx-tensorrt?: onnx/onnx-tensorrt has 3.2k stars on GitHub.
Where can I find onnx-tensorrt?: onnx/onnx-tensorrt is on GitHub at https://github.com/onnx/onnx-tensorrt.