NVIDIA-AI-Blueprints/video-search-and-summarization
GPU-accelerated vision agents for video search, summarization, and visual Q&A using VLMs and LLMs.

NVIDIA’s Video Search and Summarization Blueprint provides reference architectures for building vision-language agents that process video streams in real time. The system combines GPU-accelerated vision microservices with vision language models and large language models to enable natural language search, semantic retrieval, clip extraction, and summarization of video content. It supports agentic workflows via Model Context Protocol and integrates with NVIDIA NIM microservices for deployment.