← all repositories

VoltaML/voltaML

A Python library for optimizing and compiling deep learning models for high-performance inference on CPU and GPU.

1.2k stars Python Inference · Serving
voltaML
Velocity · 7d
+0.8
★ / day
Trend
steady
star history

VoltaML is a lightweight library that accelerates machine learning and deep learning model inference through optimization and compilation. It supports multiple inference runtimes including TensorRT, TorchScript, ONNX, and TVM, and provides out-of-the-box support for FP16 and Int8 quantization. The library offers hardware-specific compilation targeting CPU and GPU devices with deployment achievable through a single line of code.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.