zolotukhin/zinc
A native Zig-based LLM inference engine targeting AMD RDNA3/RDNA4 GPUs via Vulkan and Apple Silicon via Metal.

Velocity · 7d
+5.3
★ / day
Trend
→steady
star history
ZINC provides local large language model inference on consumer hardware without requiring ROCm on AMD or MLX on Apple Silicon. It uses hand-written compute shaders and a managed model catalog supporting Qwen 3 and Gemma 4 variants. The engine ships as a single binary and supports streaming inference on supported GPUs across Linux and macOS.