Voice Separation with an Unknown Number of Multiple Speakers

2020-02-29ICML 2020Code Available2· sign in to hype

Eliya Nachmani, Yossi Adi, Lior Wolf

Code Available — Be the first to reproduce this paper.

Code

github.com/facebookresearch/svoice
Officialpytorch★ 1,318
github.com/muhammad-ahmed-ghani/svoice_demo
pytorch★ 37
github.com/enk100/speaker_separation
none★ 14
github.com/Mack189/gdprnn
mindspore★ 3

Abstract

We present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.

Tasks

Speech Separation

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
WHAMR!	VSUNOS	SI-SDRi	12.2	—	Unverified
WSJ0-2mix	Gated DualPathRNN	SI-SDRi	20.12	—	Unverified
WSJ0-3mix	Gated DualPathRNN	SI-SDRi	16.85	—	Unverified
WSJ0-4mix	Gated DualPathRNN	SI-SDRi	12.88	—	Unverified
WSJ0-5mix	Gated DualPathRNN	SI-SDRi	10.56	—	Unverified

Voice Separation with an Unknown Number of Multiple Speakers

Code

Abstract

Tasks

Benchmark Results

Reproductions