← all repositories

pytorch/audio

A PyTorch library for audio signal processing and machine learning, providing GPU-accelerated audio I/O, transforms, and dataset utilities.

audio
Velocity · 7d
+0.9
★ / day
Trend
steady
star history

torchaudio is the official PyTorch audio library, designed to process audio data for machine learning workflows. It offers GPU-accelerated operations, audio I/O with popular formats, and preprocessing transforms like spectrograms and MFCCs. The library includes built-in dataloaders for speech and audio datasets, plus speech-specific utilities like forced alignment for ASR training. Unlike general signal processing tools, it tightly integrates with PyTorch’s autograd system for end-to-end differentiable audio pipelines.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.