floneum/kalosm
Rust library providing local AI model inference for language, audio, and image models with quantization and GPU acceleration.

Velocity · 7d
+2.0
★ / day
Trend
→steady
star history
Kalosm is an ecosystem of Rust crates enabling local and remote AI model execution. It provides a simple interface for pre-trained models across multiple modalities including text (Llama, Mistral), audio (Whisper transcription), and image models. The Fusor runtime uses WGPU for quantized ML inference across any accelerator. The project supports quantized models and GPU acceleration for efficient local inference.