← all repositories

zzw922cn/awesome-speech-recognition-speech-synthesis-papers

Curated collection of academic papers on automatic speech recognition, text-to-speech synthesis, voice conversion, and related audio generation techniques.

awesome-speech-recognition-speech-synthesis-papers
Velocity · 7d
+0.9
★ / day
Trend
steady
star history

This repository aggregates research papers on speech and audio processing including automatic speech recognition (ASR), speaker verification, voice conversion (VC), text-to-speech (TTS) synthesis, and text-to-audio generation. The collection spans classical approaches like hidden Markov models to modern deep learning methods using CNNs, RNNs, attention mechanisms, and diffusion models. It also includes papers on language modeling for speech tasks and music generation.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.