Music Source Separation Based on a Lightweight Deep Learning Framework (DTTNET: DUAL-PATH TFC-TDF UNET)

2023-09-15Code Available2· sign in to hype

Junyu Chen, Susmitha Vekkot, Pancham Shukla

Code Available — Be the first to reproduce this paper.

Code

github.com/junyuchen-cjy/dttnet-pytorch
OfficialIn paperpytorch★ 104
github.com/FaceOnLive/Spleeter-Android-iOS
tf★ 224

Abstract

Music source separation (MSS) aims to extract 'vocals', 'drums', 'bass' and 'other' tracks from a piece of mixed music. While deep learning methods have shown impressive results, there is a trend toward larger models. In our paper, we introduce a novel and lightweight architecture called DTTNet, which is based on Dual-Path Module and Time-Frequency Convolutions Time-Distributed Fully-connected UNet (TFC-TDF UNet). DTTNet achieves 10.12 dB cSDR on 'vocals' compared to 10.01 dB reported for Bandsplit RNN (BSRNN) but with 86.7% fewer parameters. We also assess pattern-specific performance and model generalization for intricate audio patterns.

Tasks

Music Source Separation

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
MUSDB18-HQ	Dual-Path TFC-TDF UNet (DTTNet)	SDR (avg)	8.15	—	Unverified

Music Source Separation Based on a Lightweight Deep Learning Framework (DTTNET: DUAL-PATH TFC-TDF UNET)

Code

Abstract

Tasks

Benchmark Results

Reproductions