SOTAVerified

Music Source Separation

Music source separation is the task of decomposing music into its constitutive components, e. g., yielding separated stems for the vocals, bass, and drums.

( Image credit: SigSep )

Papers

Showing 76100 of 107 papers

TitleStatusHype
Feature-informed Latent Space Regularization for Music Source Separation0
Hierarchic Temporal Convolutional Network With Cross-Domain Encoder for Music Source Separation0
HTMD-Net: A Hybrid Masking-Denoising Approach to Time-Domain Monaural Singing Voice Separation0
Hybrid Y-Net Architecture for Singing Voice Separation0
Improving Real-Time Music Accompaniment Separation with MMDenseNet0
Is MixIT Really Unsuitable for Correlated Sources? Exploring MixIT for Unsupervised Pre-training in Music Source Separation0
Jointist: Simultaneous Improvement of Multi-instrument Transcription and Music Source Separation via Joint Training0
MAJL: A Model-Agnostic Joint Learning Framework for Music Source Separation and Pitch Estimation0
MBTFNet: Multi-Band Temporal-Frequency Neural Network For Singing Voice Enhancement0
Multi-scale temporal-frequency attention for music source separation0
Multitask learning for instrument activation aware music source separation0
Music Foundation Model as Generic Booster for Music Downstream Tasks0
Music Separation Enhancement with Generative Modeling0
0/1 Deep Neural Networks via Block Coordinate Descent0
Pac-HuBERT: Self-Supervised Music Source Separation via Primitive Auditory Clustering and Hidden-Unit BERT0
Pre-trained Spatial Priors on Multichannel NMF for Music Source Separation0
Real-time Low-latency Music Source Separation using Hybrid Spectrogram-TasNet0
Distortion Audio Effects: Learning How to Recover the Clean Signal0
Resource-constrained stereo singing voice cancellation0
Sanidha: A Studio Quality Multi-Modal Dataset for Carnatic Music0
Self-refining of Pseudo Labels for Music Source Separation with Noisy Labeled Data0
Separate This, and All of these Things Around It: Music Source Separation via Hyperellipsoidal Queries0
Solving Copyright Infringement on Short Video Platforms: Novel Datasets and an Audio Restoration Deep Learning Pipeline0
Source Separation and Depthwise Separable Convolutions for Computer Audition0
SpaIn-Net: Spatially-Informed Stereophonic Music Source Separation0
Show:102550
← PrevPage 4 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Sparse HT Demucs (fine tuned)SDR (avg)9.2Unverified
2Hybrid Transformer Demucs (f.t.)SDR (avg)9Unverified
3Band-Split RNN (semi-sup.)SDR (avg)8.97Unverified
4TFC-TDF-UNet (v3)SDR (avg)8.34Unverified
5Band-Split RNNSDR (avg)8.23Unverified
6Hybrid DemucsSDR (avg)7.72Unverified
7KUIELab-MDX-NetSDR (avg)7.54Unverified
8CDE-HTCNSDR (avg)6.89Unverified
9Attentive-MultiResUNetSDR (avg)6.81Unverified
10DEMUCS (extra)SDR (avg)6.79Unverified
#ModelMetricClaimedVerifiedStatus
1BS-RoFormer (L=12, OA)SDR (avg)11.99Unverified
2BS-RoFormer (L=6, OA)SDR (avg)9.8Unverified
3SCNet-largeSDR (avg)9.69Unverified
4Sparse HT Demucs (fine tuned)SDR (avg)9.2Unverified
5Hybrid Transformer Demucs (f.t.)SDR (avg)9Unverified
6SCNetSDR (avg)9Unverified
7Band-Split RNN (semi-sup.)SDR (avg)8.97Unverified
8TFC-TDF-UNet (v3)SDR (avg)8.34Unverified
9Band-Split RNNSDR (avg)8.24Unverified
10Dual-Path TFC-TDF UNet (DTTNet)SDR (avg)8.15Unverified
#ModelMetricClaimedVerifiedStatus
1DiCoSe (Deterministic)SI-SDRi (Bass)20.04Unverified
2LQ-VAE + Scalable TransformerSDR (bass)7.42Unverified