← all repositories

OpenGVLab/InternVideo

InternVideo provides video foundation models for multimodal video understanding, including video LLMs with temporal reasoning capabilities.

InternVideo
Velocity · 7d
+1.8
★ / day
Trend
steady
star history

This repository contains the InternVideo series of video foundation models trained via generative and discriminative self-supervised learning. InternVideo2 scales these models with multimodal capabilities including video-language alignment and instruction tuning. InternVideo2.5 adds long-context modeling for extended video understanding. The models are distilled into smaller variants and integrated with 7B language models for video chat applications.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.