Avarok-Cybersecurity/atlas
A high-performance pure Rust inference engine for running large language models on NVIDIA, AMD, and Intel GPUs.

Velocity · 7d
+14
★ / day
Trend
→steady
star history
Atlas is a pure Rust-based LLM inference engine designed for speed and portability across hardware platforms. It supports speculative decoding and state space models like Mamba alongside transformer architectures. The engine targets data center GPUs (DGX, GB10) and exposes an OpenAI-compatible API, allowing integration with existing LLM tooling while leveraging low-level hardware acceleration via CUDA, ROCm, and oneAPI.