← all repositories

Avarok-Cybersecurity/atlas

A high-performance pure Rust inference engine for running large language models on NVIDIA, AMD, and Intel GPUs.

atlas
Velocity · 7d
+14
★ / day
Trend
steady
star history

Atlas is a pure Rust-based LLM inference engine designed for speed and portability across hardware platforms. It supports speculative decoding and state space models like Mamba alongside transformer architectures. The engine targets data center GPUs (DGX, GB10) and exposes an OpenAI-compatible API, allowing integration with existing LLM tooling while leveraging low-level hardware acceleration via CUDA, ROCm, and oneAPI.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.