Maknee/minigpt4.cpp

C++ implementation of MiniGPT-4 enabling quantized CPU inference using the GGML library.

★572 stars C++ Inference · Serving Language Models

View on GitHub ↗

Velocity · 7d

+0.5

★ / day

Trend

→steady

star history

This project ports MiniGPT-4, a vision-language model, into pure C/C++ using the GGML tensor library for efficient inference. It supports multiple quantization levels (4-bit through 16-bit) to enable running large models on CPU with reduced memory footprint. The implementation includes a web UI and can load and preprocess images within the library.