← all repositories

luminal-ai/luminal

A Rust-based inference compiler that optimizes and runs ML models on GPU hardware.

luminal
Velocity · 7d
+2.7
★ / day
Trend
steady
star history

Luminal is a general-purpose inference compiler for ML models written in Rust. It directly integrates with PyTorch as a compiler backend via torch.compile(), allowing users to compile PyTorch models for optimized inference on CUDA hardware. The project targets near-theoretical maximum performance on hardware like H100 GPUs and supports running large language models like Llama 3 8B locally on consumer and data-center GPUs.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.