nunchaku-ai/nunchaku
SVDQuant-based inference engine enabling efficient 4-bit diffusion models for image generation.

Velocity · 7d
+6.7
★ / day
Trend
→steady
star history
Nunchaku is a high-performance inference engine for 4-bit quantized neural networks as introduced in the SVDQuant paper (ICLR 2025 Spotlight). It absorbs outliers using low-rank components to enable aggressive quantization of diffusion models including Flux. The project provides ComfyUI integration and quantized model weights for practical deployment of highly compressed generative models.