SOTAVerified

Music Source Separation

Music source separation is the task of decomposing music into its constitutive components, e. g., yielding separated stems for the vocals, bass, and drums.

( Image credit: SigSep )

Papers

Showing 2650 of 107 papers

TitleStatusHype
End-to-end music source separation: is it possible in the waveform domain?Code1
Danna-Sep: Unite to separate them allCode1
VocaLiST: An Audio-Visual Synchronisation Model for Lips and VoicesCode1
Unsupervised Music Source Separation Using Differentiable Parametric Source ModelsCode1
Music Source RestorationCode1
Unsupervised Interpretable Representation Learning for Singing Voice SeparationCode1
Multi-Task Audio Source SeparationCode1
A cappella: Audio-visual Singing Voice SeparationCode1
Multi-channel U-Net for Music Source SeparationCode1
Mixing-Specific Data Augmentation Techniques for Improved Blind Violin/Piano Source SeparationCode1
CWS-PResUNet: Music Source Separation with Channel-wise Subband Phase-aware ResUNetCode1
An Efficient Short-Time Discrete Cosine Transform and Attentive MultiResUNet Framework for Music Source SeparationCode1
Music Mixing Style Transfer: A Contrastive Learning Approach to Disentangle Audio EffectsCode1
Music Demixing Challenge 2021Code1
Unsupervised Source Separation via Bayesian Inference in the Latent DomainCode1
Adversarial Semi-Supervised Audio Source Separation applied to Singing Voice ExtractionCode0
Sams-Net: A Sliced Attention-based Neural Network for Music Source SeparationCode0
Semi-Supervised Monaural Singing Voice Separation With a Masking Network Trained on Synthetic MixturesCode0
Machine Perceptual Quality: Evaluating the Impact of Severe Lossy Compression on Audio and Image ModelsCode0
Primal-Dual Algorithms for Non-negative Matrix Factorization with the Kullback-Leibler DivergenceCode0
Low Latency Time Domain Multichannel Speech and Music Source SeparationCode0
Learned Compression for Compressed LearningCode0
Sampling-Frequency-Independent Audio Source Separation Using Convolution Layer Based on Impulse Invariant MethodCode0
Music source separation conditioned on 3D point cloudsCode0
Open-Unmix - A Reference Implementation for Music Source SeparationCode0
Show:102550
← PrevPage 2 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Sparse HT Demucs (fine tuned)SDR (avg)9.2Unverified
2Hybrid Transformer Demucs (f.t.)SDR (avg)9Unverified
3Band-Split RNN (semi-sup.)SDR (avg)8.97Unverified
4TFC-TDF-UNet (v3)SDR (avg)8.34Unverified
5Band-Split RNNSDR (avg)8.23Unverified
6Hybrid DemucsSDR (avg)7.72Unverified
7KUIELab-MDX-NetSDR (avg)7.54Unverified
8CDE-HTCNSDR (avg)6.89Unverified
9Attentive-MultiResUNetSDR (avg)6.81Unverified
10DEMUCS (extra)SDR (avg)6.79Unverified
#ModelMetricClaimedVerifiedStatus
1BS-RoFormer (L=12, OA)SDR (avg)11.99Unverified
2BS-RoFormer (L=6, OA)SDR (avg)9.8Unverified
3SCNet-largeSDR (avg)9.69Unverified
4Sparse HT Demucs (fine tuned)SDR (avg)9.2Unverified
5Hybrid Transformer Demucs (f.t.)SDR (avg)9Unverified
6SCNetSDR (avg)9Unverified
7Band-Split RNN (semi-sup.)SDR (avg)8.97Unverified
8TFC-TDF-UNet (v3)SDR (avg)8.34Unverified
9Band-Split RNNSDR (avg)8.24Unverified
10Dual-Path TFC-TDF UNet (DTTNet)SDR (avg)8.15Unverified
#ModelMetricClaimedVerifiedStatus
1DiCoSe (Deterministic)SI-SDRi (Bass)20.04Unverified
2LQ-VAE + Scalable TransformerSDR (bass)7.42Unverified