SOTAVerified

Music Source Separation

Music source separation is the task of decomposing music into its constitutive components, e. g., yielding separated stems for the vocals, bass, and drums.

( Image credit: SigSep )

Papers

Showing 76100 of 107 papers

TitleStatusHype
D3Net: Densely connected multidilated DenseNet for music source separationCode0
A frugal approach to music source separation0
Mixing-Specific Data Augmentation Techniques for Improved Blind Violin/Piano Source SeparationCode1
Multitask learning for instrument activation aware music source separation0
Depthwise Separable Convolutions Versus Recurrent Neural Networks for Monaural Singing Voice Separation0
Multi-channel U-Net for Music Source SeparationCode1
Voice and accompaniment separation in music using self-attention convolutional neural network0
Unsupervised Interpretable Representation Learning for Singing Voice SeparationCode1
Meta-learning Extractors for Music Source SeparationCode1
Time-Domain Audio Source Separation Based on Wave-U-Net Combined with Discrete Wavelet TransformCode1
Music Source Separation in the Waveform DomainCode0
Spleeter: A Fast And State-of-the Art Music Source Separation Tool With Pre-trained ModelsCode1
Bootstrapping deep music separation from primitive auditory grouping principles0
Cutting Music Source Separation Some Slakh: A Dataset to Study the Impact of Training Data Quality and Quantity0
Sams-Net: A Sliced Attention-based Neural Network for Music Source SeparationCode0
Open-Unmix - A Reference Implementation for Music Source SeparationCode0
Demucs: Deep Extractor for Music Sources with extra unlabeled data remixedCode0
Dilated Convolution with Dilated GRU for Music Source Separation0
Examining the Mapping Functions of Denoising Autoencoders in Singing Voice Separation0
Spectrogram Feature Losses for Music Source Separation0
Semi-Supervised Monaural Singing Voice Separation With a Masking Network Trained on Synthetic MixturesCode0
Class-conditional embeddings for music source separation0
End-to-End Sound Source Separation Conditioned On Instrument Labels0
End-to-end music source separation: is it possible in the waveform domain?Code1
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech SeparationCode3
Show:102550
← PrevPage 4 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Sparse HT Demucs (fine tuned)SDR (avg)9.2Unverified
2Hybrid Transformer Demucs (f.t.)SDR (avg)9Unverified
3Band-Split RNN (semi-sup.)SDR (avg)8.97Unverified
4TFC-TDF-UNet (v3)SDR (avg)8.34Unverified
5Band-Split RNNSDR (avg)8.23Unverified
6Hybrid DemucsSDR (avg)7.72Unverified
7KUIELab-MDX-NetSDR (avg)7.54Unverified
8CDE-HTCNSDR (avg)6.89Unverified
9Attentive-MultiResUNetSDR (avg)6.81Unverified
10DEMUCS (extra)SDR (avg)6.79Unverified
#ModelMetricClaimedVerifiedStatus
1BS-RoFormer (L=12, OA)SDR (avg)11.99Unverified
2BS-RoFormer (L=6, OA)SDR (avg)9.8Unverified
3SCNet-largeSDR (avg)9.69Unverified
4Sparse HT Demucs (fine tuned)SDR (avg)9.2Unverified
5Hybrid Transformer Demucs (f.t.)SDR (avg)9Unverified
6SCNetSDR (avg)9Unverified
7Band-Split RNN (semi-sup.)SDR (avg)8.97Unverified
8TFC-TDF-UNet (v3)SDR (avg)8.34Unverified
9Band-Split RNNSDR (avg)8.24Unverified
10Dual-Path TFC-TDF UNet (DTTNet)SDR (avg)8.15Unverified
#ModelMetricClaimedVerifiedStatus
1DiCoSe (Deterministic)SI-SDRi (Bass)20.04Unverified
2LQ-VAE + Scalable TransformerSDR (bass)7.42Unverified