EmulationAI/awesome-large-audio-models
Curated list of research papers, models, and resources on applying Large Language Models to audio, speech, and music processing tasks.

This repository provides a comprehensive collection of resources on Large Language Models applied to audio AI. It supplements an arxiv survey paper covering transformer-based architectures for audio processing. The collection covers topics including automatic speech recognition, speech-to-text, music generation and analysis, and broader audio signal processing with LLMs. Topics span foundational models for audio, speech LLMs, and the intersection of NLP with audio understanding.