← all repositories

showlab/Paper2Video

Paper2Video converts academic papers with a speaker photo and reference audio into narrated presentation videos.

Paper2Video
Velocity · 7d
+9.3
★ / day
Trend
steady
star history

The system takes a scientific paper, a speaker image, and an audio reference as input and automatically synthesizes a presentation video. It leverages multiple AI components—likely language models for paper comprehension, speech synthesis for audio generation, and video generation for avatar animation. The project includes a trained model and a supporting dataset for training and evaluation.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.