luminal-ai/luminal
A Rust-based inference compiler that optimizes and runs ML models on GPU hardware.

Velocity · 7d
+2.7
★ / day
Trend
→steady
star history
Luminal is a general-purpose inference compiler for ML models written in Rust. It directly integrates with PyTorch as a compiler backend via torch.compile(), allowing users to compile PyTorch models for optimized inference on CUDA hardware. The project targets near-theoretical maximum performance on hardware like H100 GPUs and supports running large language models like Llama 3 8B locally on consumer and data-center GPUs.