Maknee/minigpt4.cpp
C++ implementation of MiniGPT-4 enabling quantized CPU inference using the GGML library.

Velocity · 7d
+0.5
★ / day
Trend
→steady
star history
This project ports MiniGPT-4, a vision-language model, into pure C/C++ using the GGML tensor library for efficient inference. It supports multiple quantization levels (4-bit through 16-bit) to enable running large models on CPU with reduced memory footprint. The implementation includes a web UI and can load and preprocess images within the library.