google-ai-edge/litert-torch
Google's tool for converting PyTorch models to LiteRT's .tflite format for on-device ML inference.

Velocity · 7d
+1.4
★ / day
Trend
→steady
star history
LiteRT Torch is a Python library that converts PyTorch models into TFLite flatbuffers for edge deployment. It integrates with torch.export() and provides broad CPU coverage with initial GPU and NPU support for on-device inference. The Generative API specifically supports authoring and quantizing LLMs and transformer-based models to optimize performance on mobile and IoT devices.