qualcomm/GenieX
A high-performance on-device inference SDK for running frontier LLMs and VLMs on NPU, GPU, and CPU across Android, Windows, and Linux.

NexaSDK enables local execution of multimodal AI models including Qwen3-VL, DeepSeek-OCR, and Gemma-3n on edge devices. It provides comprehensive runtime coverage for GPU, NPU, and CPU hardware across mobile (Android/iOS), desktop (Windows/Linux), and IoT platforms. The SDK offers Python and C++ APIs with day-0 support for newly released models, targeting developers building on-device AI applications with minimal energy consumption.
Frequently asked
- What is qualcomm/GenieX?
- A high-performance on-device inference SDK for running frontier LLMs and VLMs on NPU, GPU, and CPU across Android, Windows, and Linux.
- Is GenieX open source?
- Yes — qualcomm/GenieX is open source, released under the BSD-3-Clause license.
- What language is GenieX written in?
- qualcomm/GenieX is primarily written in Rust.
- How popular is GenieX?
- qualcomm/GenieX has 8.1k stars on GitHub and is currently cooling off.
- Where can I find GenieX?
- qualcomm/GenieX is on GitHub at https://github.com/qualcomm/GenieX.