Kaden-Schutt/hipfire
A native LLM inference engine for AMD RDNA GPUs built in Rust, shipping as a single binary with HIP kernels.

Velocity · 7d
+5.4
★ / day
Trend
→steady
star history
Hipfire is an RDNA-native inference engine targeting the entire AMD RDNA family from RDNA1 through RDNA4 on consumer, pro, and APU hardware. It runs LLM inference without Python or the full ROCm userspace stack, instead shipping pre-compiled kernels and JIT-compiling the rest through HIP. Supports quantization, speculative decoding via DFlash, and provides an Ollama-style CLI with an OpenAI-compatible API daemon.