Is audio open source?

Yes — pytorch/audio is open source, released under the BSD-2-Clause license.

What language is audio written in?

pytorch/audio is primarily written in Python.

How popular is audio?

pytorch/audio has 2.9k stars on GitHub.

Where can I find audio?

pytorch/audio is on GitHub at https://github.com/pytorch/audio.

← all repositories

pytorch/audio

PyTorch's audio layer goes on a maintenance diet

TorchAudio is PyTorch's audio preprocessing layer, recently put on a maintenance diet to stay focused on ML.

★2.9k stars Python Image · Video · Audio Data Tooling

View on GitHub ↗ Homepage ↗

Not currently ranked — collecting fresh signals.

star history

What it does

TorchAudio provides audio transforms, dataset loaders, and compliance interfaces that operate entirely as PyTorch operations. That means spectrograms, mel-scale conversions, resampling, and MFCCs can run on a GPU and participate in the autograd graph. It is explicitly a machine-learning library, not a general-purpose signal-processing toolkit.

The interesting bit

The project recently entered a maintenance phase where maintainers deprecated and removed user-facing features from versions 2.8 and 2.9. The goal was to eliminate redundancies with the broader PyTorch ecosystem and shrink the scope to what the library actually does well: preparing audio data for ML pipelines.

Key highlights

All transforms—Spectrogram, MelSpectrogram, MFCC, Resample, and others—are native PyTorch ops, so they are GPU-accelerated and differentiable.
Includes dataloaders for common audio datasets and a forced_align speech-processing function.
Offers Kaldi compliance interfaces for spectrogram, fbank, and mfcc to ease migration from speech-recipe pipelines.
Pre-trained models are available, though each carries its own license (for example, the SquimSubjective model is CC-BY-NC 4.0).

Caveats

The library is in maintenance mode; active development has shifted to pruning rather than expanding capabilities, and some previously available features were removed in release 2.9.
It does not aim to be a general audio or signal-processing library, so if your work is outside ML training pipelines, the scope may feel narrow.

Verdict

Use this if you are already inside the PyTorch ecosystem and need audio preprocessing that behaves like every other tensor operation in your pipeline. If you are looking for a broad, general-purpose audio engineering framework, this is intentionally not it.

Frequently asked

What is pytorch/audio?: TorchAudio is PyTorch's audio preprocessing layer, recently put on a maintenance diet to stay focused on ML.
Is audio open source?: Yes — pytorch/audio is open source, released under the BSD-2-Clause license.
What language is audio written in?: pytorch/audio is primarily written in Python.
How popular is audio?: pytorch/audio has 2.9k stars on GitHub.
Where can I find audio?: pytorch/audio is on GitHub at https://github.com/pytorch/audio.