← all repositories

tsurumeso/vocal-remover

A PyTorch-based deep learning tool that separates vocals from instrumental tracks in audio files using U-Net convolutional networks.

1.7k stars Python Image · Video · Audio
vocal-remover
Velocity · 7d
+0.7
★ / day
Trend
steady
star history

This repository provides a deep neural network for extracting instrumental tracks from songs by separating vocal and non-vocal components. It uses a U-Net convolutional architecture adapted for audio spectrogram processing. The tool supports CPU and GPU inference, test-time augmentation, and post-processing options. Users can also train custom models on their own datasets of mixed audio and separated stems.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.