← all repositories

EmulationAI/awesome-large-audio-models

Curated list of research papers, models, and resources on applying Large Language Models to audio, speech, and music processing tasks.

awesome-large-audio-models
Velocity · 7d
+0.7
★ / day
Trend
steady
star history

This repository provides a comprehensive collection of resources on Large Language Models applied to audio AI. It supplements an arxiv survey paper covering transformer-based architectures for audio processing. The collection covers topics including automatic speech recognition, speech-to-text, music generation and analysis, and broader audio signal processing with LLMs. Topics span foundational models for audio, speech LLMs, and the intersection of NLP with audio understanding.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.