jonyzhang2023/awesome-embodied-vla-va-vln
A curated collection of 700+ research papers on embodied AI models including vision-language-action (VLA), world-action models (WAM), and vision-language navigation (VLN) systems.

This repository aggregates state-of-the-art research in embodied AI, specifically covering vision-language-action models that enable robots to reason and act from visual and language inputs. It catalogs work on vision-language navigation tasks, diffusion-based action policies, and MLLM-driven robotic planning systems. The collection includes survey papers, model architectures, and benchmarking resources across the embodied AI research space.