alibaba/MNN
MNN is a high-performance, lightweight inference engine from Alibaba that runs LLMs and deep learning models on mobile and edge devices.

Velocity · 7d
+5.9
★ / day
Trend
→steady
star history
MNN is an inference runtime designed for efficient execution of deep learning models on resource-constrained devices. It supports quantization, hardware acceleration via Vulkan, and runs transformer-based LLMs including Qwen series entirely on-device. The engine powers applications such as local LLM chatbots, 3D avatar interaction, and AI image editing.