ActiveVisionLab/Awesome-LLM-3D
A curated paper list covering multi-modal LLMs applied to 3D understanding, reasoning, generation, and embodied agent tasks.

This repository maintains a comprehensive collection of academic papers on large language models operating in 3D environments. It catalogs research across multiple task categories including 3D scene understanding, spatial reasoning, 3D content generation, and embodied AI agents. The list also incorporates foundation models like CLIP and SAM to provide broader context on the field. The repository is actively maintained with regular updates tracking the latest advances through 2025.