SOTAVerified

Music Source Separation

Music source separation is the task of decomposing music into its constitutive components, e. g., yielding separated stems for the vocals, bass, and drums.

( Image credit: SigSep )

Papers

Showing 51100 of 107 papers

TitleStatusHype
Improving Real-Time Music Accompaniment Separation with MMDenseNet0
Why does music source separation benefit from cacophony?0
Real-time Low-latency Music Source Separation using Hybrid Spectrogram-TasNet0
Resource-constrained stereo singing voice cancellation0
Machine Perceptual Quality: Evaluating the Impact of Severe Lossy Compression on Audio and Image ModelsCode0
Subnetwork-to-go: Elastic Neural Network with Dynamic Training and Customizable Inference0
Pre-trained Spatial Priors on Multichannel NMF for Music Source Separation0
MBTFNet: Multi-Band Temporal-Frequency Neural Network For Singing Voice Enhancement0
Contrastive Learning based Deep Latent Masking for Music Source Separation0
Self-refining of Pseudo Labels for Music Source Separation with Noisy Labeled Data0
Pac-HuBERT: Self-Supervised Music Source Separation via Primitive Auditory Clustering and Hidden-Unit BERT0
Hybrid Y-Net Architecture for Singing Voice Separation0
Jointist: Simultaneous Improvement of Multi-instrument Transcription and Music Source Separation via Joint Training0
Multi-scale temporal-frequency attention for music source separation0
Music Separation Enhancement with Generative Modeling0
Hierarchic Temporal Convolutional Network With Cross-Domain Encoder for Music Source Separation0
0/1 Deep Neural Networks via Block Coordinate Descent0
Low Latency Time Domain Multichannel Speech and Music Source SeparationCode0
Feature-informed Latent Space Regularization for Music Source Separation0
On loss functions and evaluation metrics for music source separation0
SpaIn-Net: Spatially-Informed Stereophonic Music Source Separation0
Distortion Audio Effects: Learning How to Recover the Clean Signal0
Upsampling layers for music source separation0
Hybrid Spectrogram and Waveform Source SeparationCode0
A Hands-on Comparison of DNNs for Dialog Separation Using Transfer Learning from Music Source Separation0
Sampling-Frequency-Independent Audio Source Separation Using Convolution Layer Based on Impulse Invariant MethodCode0
HTMD-Net: A Hybrid Masking-Denoising Approach to Time-Domain Monaural Singing Voice Separation0
Music source separation conditioned on 3D point cloudsCode0
Adversarial Unsupervised Domain Adaptation for Harmonic-Percussive Source Separation0
Source Separation and Depthwise Separable Convolutions for Computer Audition0
Densely connected multidilated convolutional networks for dense prediction tasksCode0
A Study of Transfer Learning in Music Source Separation0
Transcription Is All You Need: Learning to Separate Musical Mixtures with Score as Supervision0
Fast accuracy estimation of deep learning based multi-class musical source separation0
D3Net: Densely connected multidilated DenseNet for music source separationCode0
A frugal approach to music source separation0
Multitask learning for instrument activation aware music source separation0
Depthwise Separable Convolutions Versus Recurrent Neural Networks for Monaural Singing Voice Separation0
Voice and accompaniment separation in music using self-attention convolutional neural network0
Music Source Separation in the Waveform DomainCode0
Bootstrapping deep music separation from primitive auditory grouping principles0
Cutting Music Source Separation Some Slakh: A Dataset to Study the Impact of Training Data Quality and Quantity0
Sams-Net: A Sliced Attention-based Neural Network for Music Source SeparationCode0
Open-Unmix - A Reference Implementation for Music Source SeparationCode0
Demucs: Deep Extractor for Music Sources with extra unlabeled data remixedCode0
Dilated Convolution with Dilated GRU for Music Source Separation0
Examining the Mapping Functions of Denoising Autoencoders in Singing Voice Separation0
Spectrogram Feature Losses for Music Source Separation0
Semi-Supervised Monaural Singing Voice Separation With a Masking Network Trained on Synthetic MixturesCode0
Class-conditional embeddings for music source separation0
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Sparse HT Demucs (fine tuned)SDR (avg)9.2Unverified
2Hybrid Transformer Demucs (f.t.)SDR (avg)9Unverified
3Band-Split RNN (semi-sup.)SDR (avg)8.97Unverified
4TFC-TDF-UNet (v3)SDR (avg)8.34Unverified
5Band-Split RNNSDR (avg)8.23Unverified
6Hybrid DemucsSDR (avg)7.72Unverified
7KUIELab-MDX-NetSDR (avg)7.54Unverified
8CDE-HTCNSDR (avg)6.89Unverified
9Attentive-MultiResUNetSDR (avg)6.81Unverified
10DEMUCS (extra)SDR (avg)6.79Unverified
#ModelMetricClaimedVerifiedStatus
1BS-RoFormer (L=12, OA)SDR (avg)11.99Unverified
2BS-RoFormer (L=6, OA)SDR (avg)9.8Unverified
3SCNet-largeSDR (avg)9.69Unverified
4Sparse HT Demucs (fine tuned)SDR (avg)9.2Unverified
5SCNetSDR (avg)9Unverified
6Hybrid Transformer Demucs (f.t.)SDR (avg)9Unverified
7Band-Split RNN (semi-sup.)SDR (avg)8.97Unverified
8TFC-TDF-UNet (v3)SDR (avg)8.34Unverified
9Band-Split RNNSDR (avg)8.24Unverified
10Dual-Path TFC-TDF UNet (DTTNet)SDR (avg)8.15Unverified
#ModelMetricClaimedVerifiedStatus
1DiCoSe (Deterministic)SI-SDRi (Bass)20.04Unverified
2LQ-VAE + Scalable TransformerSDR (bass)7.42Unverified