SOTAVerified

Audio Source Separation

Audio Source Separation is the process of separating a mixture (e.g. a pop band recording) into isolated sounds from individual sources (e.g. just the lead vocals).

Source: Model selection for deep audio source separation via clustering analysis

Papers

Showing 125 of 112 papers

TitleStatusHype
Separate Anything You DescribeCode3
Training-Free Multi-Step Audio Source SeparationCode2
A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four StemsCode2
Leveraging LLM and Text-Queried Separation for Noise-Robust Sound Event DetectionCode1
Exploring Text-Queried Sound Event Detection with Audio Source SeparationCode1
Unsupervised Composable Representations for AudioCode1
Facing the Music: Tackling Singing Voice Separation in Cinematic Audio Source SeparationCode1
Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual SupportCode1
Spectral Mapping of Singing Voices: U-Net-Assisted Vocal SegmentationCode1
A Generalized Bandsplit Neural Network for Cinematic Audio Source SeparationCode1
Deep Audio Waveform PriorCode1
Separate What You Describe: Language-Queried Audio Source SeparationCode1
Unsupervised Music Source Separation Using Differentiable Parametric Source ModelsCode1
Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled DataCode1
Hybrid Neural Networks for On-device Directional HearingCode1
Transfer Learning with Jukebox for Music Source SeparationCode1
Unsupervised Source Separation By Steering Pretrained Music ModelsCode1
The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World SoundtracksCode1
Unsupervised Source Separation via Bayesian Inference in the Latent DomainCode1
Multi-Task Audio Source SeparationCode1
Parallel and Flexible Sampling from Autoregressive Models via Langevin DynamicsCode1
Differentiable Model Compression via Pseudo Quantization NoiseCode1
Compute and memory efficient universal sound source separationCode1
Directional Sparse Filtering using Weighted Lehmer Mean for Blind Separation of Unbalanced Speech MixturesCode1
Unified Gradient Reweighting for Model Biasing with Applications to Source SeparationCode1
Show:102550
← PrevPage 1 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ST-SED-SEPSDR10.55Unverified
2Co-SeparationSDR4.26Unverified
#ModelMetricClaimedVerifiedStatus
1Co-SeparationSAR11.3Unverified