SOTAVerified

Music Source Separation

Music source separation is the task of decomposing music into its constitutive components, e. g., yielding separated stems for the vocals, bass, and drums.

( Image credit: SigSep )

Papers

Showing 5175 of 107 papers

TitleStatusHype
A Hands-on Comparison of DNNs for Dialog Separation Using Transfer Learning from Music Source Separation0
An Ensemble Approach to Music Source Separation: A Comparative Analysis of Conventional and Hierarchical Stem Separation0
A Study of Transfer Learning in Music Source Separation0
Bootstrapping deep music separation from primitive auditory grouping principles0
Class-conditional embeddings for music source separation0
Contrastive Learning based Deep Latent Masking for Music Source Separation0
Cutting Music Source Separation Some Slakh: A Dataset to Study the Impact of Training Data Quality and Quantity0
Deep Clustering and Conventional Networks for Music Separation: Stronger Together0
Demucs: Deep Extractor for Music Sources with extra unlabeled data remixed0
Denoising Auto-encoder with Recurrent Skip Connections and Residual Regression for Music Source Separation0
Densely connected multidilated convolutional networks for dense prediction tasks0
Depthwise Separable Convolutions Versus Recurrent Neural Networks for Monaural Singing Voice Separation0
Dilated Convolution with Dilated GRU for Music Source Separation0
End-to-End Sound Source Separation Conditioned On Instrument Labels0
Examining the Mapping Functions of Denoising Autoencoders in Singing Voice Separation0
Fast accuracy estimation of deep learning based multi-class musical source separation0
Feature-informed Latent Space Regularization for Music Source Separation0
Hierarchic Temporal Convolutional Network With Cross-Domain Encoder for Music Source Separation0
HTMD-Net: A Hybrid Masking-Denoising Approach to Time-Domain Monaural Singing Voice Separation0
Hybrid Spectrogram and Waveform Source Separation0
Hybrid Y-Net Architecture for Singing Voice Separation0
Improving Real-Time Music Accompaniment Separation with MMDenseNet0
Is MixIT Really Unsuitable for Correlated Sources? Exploring MixIT for Unsupervised Pre-training in Music Source Separation0
Jointist: Simultaneous Improvement of Multi-instrument Transcription and Music Source Separation via Joint Training0
MAJL: A Model-Agnostic Joint Learning Framework for Music Source Separation and Pitch Estimation0
Show:102550
← PrevPage 3 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Sparse HT Demucs (fine tuned)SDR (avg)9.2Unverified
2Hybrid Transformer Demucs (f.t.)SDR (avg)9Unverified
3Band-Split RNN (semi-sup.)SDR (avg)8.97Unverified
4TFC-TDF-UNet (v3)SDR (avg)8.34Unverified
5Band-Split RNNSDR (avg)8.23Unverified
6Hybrid DemucsSDR (avg)7.72Unverified
7KUIELab-MDX-NetSDR (avg)7.54Unverified
8CDE-HTCNSDR (avg)6.89Unverified
9Attentive-MultiResUNetSDR (avg)6.81Unverified
10DEMUCS (extra)SDR (avg)6.79Unverified
#ModelMetricClaimedVerifiedStatus
1BS-RoFormer (L=12, OA)SDR (avg)11.99Unverified
2BS-RoFormer (L=6, OA)SDR (avg)9.8Unverified
3SCNet-largeSDR (avg)9.69Unverified
4Sparse HT Demucs (fine tuned)SDR (avg)9.2Unverified
5Hybrid Transformer Demucs (f.t.)SDR (avg)9Unverified
6SCNetSDR (avg)9Unverified
7Band-Split RNN (semi-sup.)SDR (avg)8.97Unverified
8TFC-TDF-UNet (v3)SDR (avg)8.34Unverified
9Band-Split RNNSDR (avg)8.24Unverified
10Dual-Path TFC-TDF UNet (DTTNet)SDR (avg)8.15Unverified
#ModelMetricClaimedVerifiedStatus
1DiCoSe (Deterministic)SI-SDRi (Bass)20.04Unverified
2LQ-VAE + Scalable TransformerSDR (bass)7.42Unverified