← all repositories

nunchaku-ai/nunchaku

SVDQuant-based inference engine enabling efficient 4-bit diffusion models for image generation.

nunchaku
Velocity · 7d
+6.7
★ / day
Trend
steady
star history

Nunchaku is a high-performance inference engine for 4-bit quantized neural networks as introduced in the SVDQuant paper (ICLR 2025 Spotlight). It absorbs outliers using low-rank components to enable aggressive quantization of diffusion models including Flux. The project provides ComfyUI integration and quantized model weights for practical deployment of highly compressed generative models.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.