← all repositories

Kaden-Schutt/hipfire

A native LLM inference engine for AMD RDNA GPUs built in Rust, shipping as a single binary with HIP kernels.

hipfire
Velocity · 7d
+5.4
★ / day
Trend
steady
star history

Hipfire is an RDNA-native inference engine targeting the entire AMD RDNA family from RDNA1 through RDNA4 on consumer, pro, and APU hardware. It runs LLM inference without Python or the full ROCm userspace stack, instead shipping pre-compiled kernels and JIT-compiling the rest through HIP. Supports quantization, speculative decoding via DFlash, and provides an Ollama-style CLI with an OpenAI-compatible API daemon.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.