tsurumeso/vocal-remover
A PyTorch-based deep learning tool that separates vocals from instrumental tracks in audio files using U-Net convolutional networks.

Velocity · 7d
+0.7
★ / day
Trend
→steady
star history
This repository provides a deep neural network for extracting instrumental tracks from songs by separating vocal and non-vocal components. It uses a U-Net convolutional architecture adapted for audio spectrogram processing. The tool supports CPU and GPU inference, test-time augmentation, and post-processing options. Users can also train custom models on their own datasets of mixed audio and separated stems.