showlab/Paper2Video
Paper2Video converts academic papers with a speaker photo and reference audio into narrated presentation videos.

Velocity · 7d
+9.3
★ / day
Trend
→steady
star history
The system takes a scientific paper, a speaker image, and an audio reference as input and automatically synthesizes a presentation video. It leverages multiple AI components—likely language models for paper comprehension, speech synthesis for audio generation, and video generation for avatar animation. The project includes a trained model and a supporting dataset for training and evaluation.