Yes — NVIDIA-AI-Blueprints/rag is open source, released under the Apache-2.0 license.

What language is rag written in?

NVIDIA-AI-Blueprints/rag is primarily written in Python.

NVIDIA-AI-Blueprints/rag has 713 stars on GitHub.

Where can I find rag?

NVIDIA-AI-Blueprints/rag is on GitHub at https://github.com/NVIDIA-AI-Blueprints/rag.

NVIDIA-AI-Blueprints/rag

NVIDIA’s RAG Blueprint wires NIMs, LangGraph, and guardrails together

Reference architecture for enterprises that want GPU-accelerated retrieval, multimodal ingestion, and optional agentic query planning behind a single OpenAI-compatible API.

★713 stars Python RAG · Search Agents

View on GitHub ↗ Homepage ↗

Not currently ranked — collecting fresh signals.

star history

What it does

This is NVIDIA’s reference integration layer for building enterprise RAG pipelines atop its own NIM microservices. It ingests multimodal documents—text, tables, charts, infographics, even audio—through NeMo Retriever, stores embeddings in a GPU-accelerated vector database, and grounds LLM answers in retrieved context with optional guardrails and reranking. A LangChain-based orchestrator exposes an OpenAI-compatible API and a sample UI for multi-turn conversations.

The interesting bit

Where a standard RAG pipeline simply retrieves then generates, the optional Agentic RAG mode uses a LangGraph plan-and-execute agent to break complex, multi-hop questions into parallel sub-tasks, run scope discovery, and synthesize answers with streaming reasoning traces. It is essentially a pre-wired research assistant mode you can toggle per request.

Key highlights

Agentic RAG with LangGraph for ambiguous or cross-document queries, including parallel sub-task execution and optional verification.
Multimodal ingestion pipeline handling text, images, tables, charts, and audio via NeMo Retriever extraction.
GPU-accelerated vector search through NVIDIA cuVS, with Elasticsearch or Milvus as pluggable backends.
Optional programmable guardrails for content safety and topic control via NemoGuard NIMs.
Pre-built agent skills for coding assistants (Claude Code, Cursor, Codex) to deploy and benchmark the stack from natural language prompts.
Validated OpenShift deployment with declarative Helm chart support.

Caveats

The architecture is tightly coupled to the NVIDIA NIM ecosystem; expect to operate multiple NVIDIA-specific microservices.
Agentic RAG limitations and minimum hardware requirements live in secondary documentation, not the main README.

Verdict

Best for teams already running NVIDIA infrastructure who need a configurable, enterprise-grade retrieval stack with guardrails and an agentic option. If you are looking for a lightweight, framework-agnostic RAG library, this is overkill.

Frequently asked

What is NVIDIA-AI-Blueprints/rag?: Reference architecture for enterprises that want GPU-accelerated retrieval, multimodal ingestion, and optional agentic query planning behind a single OpenAI-compatible API.
Is rag open source?: Yes — NVIDIA-AI-Blueprints/rag is open source, released under the Apache-2.0 license.
What language is rag written in?: NVIDIA-AI-Blueprints/rag is primarily written in Python.
How popular is rag?: NVIDIA-AI-Blueprints/rag has 713 stars on GitHub.
Where can I find rag?: NVIDIA-AI-Blueprints/rag is on GitHub at https://github.com/NVIDIA-AI-Blueprints/rag.