Question 1

What is FoundationVision/Groma?

Accepted Answer

Groma is a multimodal LLM that uses localized visual tokenization to enable region-level understanding and visual grounding capabilities.

Question 2

Is Groma open source?

Accepted Answer

Yes — FoundationVision/Groma is open source, released under the Apache-2.0 license.

Question 3

What language is Groma written in?

Accepted Answer

FoundationVision/Groma is primarily written in Python.

Question 4

How popular is Groma?

Accepted Answer

FoundationVision/Groma has 585 stars on GitHub.

Question 5

Where can I find Groma?

Accepted Answer

FoundationVision/Groma is on GitHub at https://github.com/FoundationVision/Groma.

Frequently asked