Vector Search on GPUs, Minus the Architecture Maintenance
cuVS runs vector search and clustering on NVIDIA GPUs so you don't have to chase every new CUDA release.

What it does
cuVS is a RAPIDS library that implements approximate nearest-neighbor search and clustering algorithms—think CAGRA, K-means, and HDBSCAN—on NVIDIA GPUs. It targets the usual embedding-heavy workloads: semantic search, RAG pipelines, recommender systems, and k-NN graph construction. APIs are available in Python, C++, C, and Rust, and the library supports building indexes on GPU for later CPU deployment.
The interesting bit The README explicitly says cuVS “shoulders the burden” of updating accelerated code for new NVIDIA architectures and CUDA versions. That is the tedious, high-friction work most teams ignore until a driver upgrade breaks their build, and it is the library’s most underappreciated selling point.
Key highlights
- Backed by research implementations including graph-based ANN (
CAGRA), Top-K, NN-Descent, and sparse neighborhood methods, each with cited papers. - Interoperability: build an index on GPU, then deploy it on CPU.
- Ships with drop-in CMake templates and self-contained examples in C++, C, Rust, and Python.
- Underpinned by the RAFT library of machine-learning primitives.
Caveats
- Binary size is a known rough edge. The README warns that
CUDA 12builds are roughly double the size ofCUDA 13builds, and the team will not slim down the former until Spring 2026. If footprint matters, the README points to alibcuvs-staticworkaround until then.
Verdict Worth evaluating if you are building a vector database, search layer, or clustering pipeline and want GPU throughput without owning the CUDA maintenance. Skip it if your stack is CPU-only or non-NVIDIA.
Frequently asked
- What is NVIDIA/cuvs?
- cuVS runs vector search and clustering on NVIDIA GPUs so you don't have to chase every new CUDA release.
- Is cuvs open source?
- Yes — NVIDIA/cuvs is open source, released under the Apache-2.0 license.
- What language is cuvs written in?
- NVIDIA/cuvs is primarily written in Cuda.
- How popular is cuvs?
- NVIDIA/cuvs has 790 stars on GitHub and is currently holding steady.
- Where can I find cuvs?
- NVIDIA/cuvs is on GitHub at https://github.com/NVIDIA/cuvs.