Is Eagle open source?

Yes — NVlabs/Eagle is open source, released under the Apache-2.0 license.

What language is Eagle written in?

NVlabs/Eagle is primarily written in Python.

How popular is Eagle?

NVlabs/Eagle has 3.2k stars on GitHub and is currently cooling off.

Where can I find Eagle?

NVlabs/Eagle is on GitHub at https://github.com/NVlabs/Eagle.

← all repositories

NVlabs/Eagle

NVIDIA's vision-language stack: research lab that ships

Eagle is less a single model than NVIDIA's internal R&D pipeline for multimodal AI, now open-sourced with three generations of VLMs and a grounding specialist.

★3.2k stars Python Language Models Computer Vision

View on GitHub ↗ Homepage ↗

Velocity · 7d

+7.1

★ / day

Trend

↘cooling

star history

What it does

Eagle is a family of vision-language models (VLMs) developed by NVIDIA, spanning general image understanding, long-context video reasoning, and embodied grounding. The repository hosts three generations—Eagle, Eagle 2, and Eagle 2.5—plus LocateAnything, a specialist model for detection and pointing tasks. Rather than chasing scale alone, the project emphasizes data-centric strategies: how to curate, mix, and post-train data to push frontier performance without proportionally scaling compute.

The interesting bit

Eagle functions as NVIDIA’s upstream research platform for production systems. The same models or their direct descendants power Llama-Nemotron Nano VLM, NeMo Retriever, the Isaac GR00T humanoid-robot stack (N1 through N1.6), and Cosmos. That is unusual: most academic VLMs stay in the paper; Eagle’s architecture choices and training recipes are battle-tested inside shipping products before they hit Hugging Face.

Key highlights

Three generations, one repo: Eagle (mixture-of-encoders design), Eagle 2 (post-training data strategies for image understanding), Eagle 2.5 (long-context image and video reasoning).
LocateAnything: a grounding specialist using Parallel Box Decoding—predicts each bounding box atomically in a single forward pass, which the project claims yields substantially faster throughput than quantized coordinate decoding.
Downstream adoption: serves as VLM backbone for GR00T-N1, N1.5, and N1.6 (NVIDIA’s humanoid-robot foundation models), plus Nemotron and Cosmos pipelines.
Publication record: ICLR 2025 Spotlight for Eagle; NeurIPS 2025 for Eagle 2.5.
Model licenses are split: code under Apache 2.0, model weights under a separate NVIDIA License (check Eagle2_5/LICENSE_MODEL before commercial use).

Caveats

The README is heavy on NVIDIA product cross-references and light on training details; reproducing the data recipes will require reading the full technical reports.
“SOTA” claims are stated but not quantified in the README—benchmark tables live in the linked PDFs, not the repository.

Verdict

Worth studying if you build VLMs or need to understand NVIDIA’s multimodal stack. Skip if you want a drop-in, batteries-included API; this is research infrastructure with product lineage, not a polished consumer tool.

Frequently asked

What is NVlabs/Eagle?: Eagle is less a single model than NVIDIA's internal R&D pipeline for multimodal AI, now open-sourced with three generations of VLMs and a grounding specialist.
Is Eagle open source?: Yes — NVlabs/Eagle is open source, released under the Apache-2.0 license.
What language is Eagle written in?: NVlabs/Eagle is primarily written in Python.
How popular is Eagle?: NVlabs/Eagle has 3.2k stars on GitHub and is currently cooling off.
Where can I find Eagle?: NVlabs/Eagle is on GitHub at https://github.com/NVlabs/Eagle.