pytorch/TensorRT
A PyTorch-to-TensorRT compiler that optimizes deep learning models for faster inference on NVIDIA GPUs.

Velocity · 7d
+1.3
★ / day
Trend
→steady
star history
Torch-TensorRT compiles PyTorch, TorchScript, and FX models to TensorRT engines, enabling up to 5x inference speedup on NVIDIA GPUs. It integrates directly with PyTorch as a backend, allowing users to accelerate models with minimal code changes by leveraging TensorRT’s optimization and execution capabilities.