Everlyn-Labs/Everlyn-1
An open autoregressive model for generating images and videos with multimodal understanding capabilities.

Velocity · 7d
+4.8
★ / day
Trend
→steady
star history
Everlyn-1 presents research on foundational video AI models with three main components: a distribution matching approach for video tokenization using Wasserstein distance, EfficientARV for efficient autoregressive image and video generation, and ANTRP for reducing hallucinations in multimodal large language models. The project explores image animation, inpainting, video prediction, and integration with MLLMs.