Is Grounded-Segment-Anything open source?

Yes — IDEA-Research/Grounded-Segment-Anything is open source, released under the Apache-2.0 license.

What language is Grounded-Segment-Anything written in?

IDEA-Research/Grounded-Segment-Anything is primarily written in Jupyter Notebook.

How popular is Grounded-Segment-Anything?

IDEA-Research/Grounded-Segment-Anything has 17.7k stars on GitHub.

Where can I find Grounded-Segment-Anything?

IDEA-Research/Grounded-Segment-Anything is on GitHub at https://github.com/IDEA-Research/Grounded-Segment-Anything.

← all repositories

IDEA-Research/Grounded-Segment-Anything

CV plumbing that actually works: wiring open-world models together

A wiring harness for Grounding DINO, SAM, Stable Diffusion, and friends—so you can detect, segment, and generate from a text prompt without training your own model.

★17.7k stars Jupyter Notebook Computer Vision Image · Video · Audio

View on GitHub ↗ Homepage ↗

Not currently ranked — collecting fresh signals.

star history

What it does Grounded-Segment-Anything bolts together existing computer-vision models into a single pipeline. You type “red backpack” and Grounding DINO finds it, SAM segments it, and Stable Diffusion can inpaint or replace it. The repo is mostly Jupyter notebooks and demo scripts showing how to chain these models, plus a growing playground of variations (audio prompts via Whisper, 3D mesh recovery, automatic labeling with RAM/Tag2Text, etc.).

The interesting bit The authors treat this as explicit “model assembly” rather than a monolithic system. They document how to swap in alternatives—GLIP instead of Grounding DINO, ControlNet instead of Stable Diffusion, FastSAM or MobileSAM if you need speed—and the README tracks community forks and extensions like a shared lab notebook. It’s glue code, but it’s organized glue code.

Key highlights

Zero-shot detection + segmentation from text prompts; no custom training required
Modular design: swap detectors, segmenters, or generators in and out
Extensive demo coverage: inpainting, audio-driven segmentation, 3D body mesh recovery, fashion/human-face editing, automatic dataset labeling
Achieved 46.0 mean AP on CVPR 2023 Segmentation in the Wild zero-shot track (Grounding-DINO-L + SAM-ViT-H)
Docker and local install options; Colab notebooks and HuggingFace Spaces available
Successor Grounded SAM 2 (using SAM 2) already released for video/tracking scenarios

Caveats

The repo is demo-heavy; production integration will require significant work
Performance claims are for specific model combinations; your mileage will vary with substitutions
README notes that some combinations (e.g., Grounding-DINO-B + SAM-HQ) outperform the default stack

Verdict Worth a look if you need to prototype open-vocabulary segmentation or automated labeling without training infrastructure. Skip it if you want a polished, single-model API; this is a construction kit, not a product.

Frequently asked

What is IDEA-Research/Grounded-Segment-Anything?: A wiring harness for Grounding DINO, SAM, Stable Diffusion, and friends—so you can detect, segment, and generate from a text prompt without training your own model.
Is Grounded-Segment-Anything open source?: Yes — IDEA-Research/Grounded-Segment-Anything is open source, released under the Apache-2.0 license.
What language is Grounded-Segment-Anything written in?: IDEA-Research/Grounded-Segment-Anything is primarily written in Jupyter Notebook.
How popular is Grounded-Segment-Anything?: IDEA-Research/Grounded-Segment-Anything has 17.7k stars on GitHub.
Where can I find Grounded-Segment-Anything?: IDEA-Research/Grounded-Segment-Anything is on GitHub at https://github.com/IDEA-Research/Grounded-Segment-Anything.