Singing Voice Separation with Deep U-Net Convolutional Networks

2017-10-27International Society for Music Information Retrieval 2017Code Available0· sign in to hype

Andreas Jansson, Eric Humphrey, Nicola Montecchio, Rachel Bittner, Aparna Kumar, Tillman Weyde

Code Available — Be the first to reproduce this paper.

Code

github.com/tsurumeso/vocal-remover
pytorch★ 0

Abstract

The decomposition of a music audio signal into its vocal and backing track components is analogous to image-toimage translation, where a mixed spectrogram is transformed into its constituent sources. We propose a novel application of the U-Net architecture — initially developed for medical imaging — for the task of source separation, given its proven capacity for recreating the fine, low-level detail required for high-quality audio reproduction. Through both quantitative evaluation and subjective assessment, experiments demonstrate that the proposed algorithm achieves state-of-the-art performance.

Tasks

Speech Separation Translation

Singing Voice Separation with Deep U-Net Convolutional Networks

Code

Abstract

Tasks

Reproductions