NVIDIA/GenerativeAIExamples
NVIDIA repository of reference workflows for building RAG pipelines, agentic systems, and fine-tuning pipelines using NVIDIA NeMo microservices and NIM inference engines.

This repository provides Jupyter notebook-based reference implementations for generative AI development on NVIDIA infrastructure. It covers retrieval-augmented generation workflows with knowledge graphs, agentic systems using tool calling, embedding and model fine-tuning through NeMo microservices, and model evaluation and guardrailing. Workflows are optimized for GPU acceleration using TensorRT and Triton inference server and delivered as microservices for cloud-native deployment.