ChaofanTao/Autoregressive-Models-in-Vision-Survey
A published survey (TMLR 2025) compiling papers on autoregressive models for vision tasks including image generation, video generation, and multimodal learning.

Velocity · 7d
+1.1
★ / day
Trend
→steady
star history
This repository collects and organizes academic papers on autoregressive models applied to computer vision tasks. It covers areas such as image generation, video generation, diffusion-based approaches, multimodal learning, and applications in embodied AI and medical imaging. The project is associated with a peer-reviewed survey paper published at TMLR 2025.