SOTAVerified

Audio Source Separation

Audio Source Separation is the process of separating a mixture (e.g. a pop band recording) into isolated sounds from individual sources (e.g. just the lead vocals).

Source: Model selection for deep audio source separation via clustering analysis

Papers

Showing 150 of 112 papers

TitleStatusHype
Separate Anything You DescribeCode3
Training-Free Multi-Step Audio Source SeparationCode2
A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four StemsCode2
Leveraging LLM and Text-Queried Separation for Noise-Robust Sound Event DetectionCode1
Exploring Text-Queried Sound Event Detection with Audio Source SeparationCode1
Unsupervised Composable Representations for AudioCode1
Facing the Music: Tackling Singing Voice Separation in Cinematic Audio Source SeparationCode1
Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual SupportCode1
Spectral Mapping of Singing Voices: U-Net-Assisted Vocal SegmentationCode1
A Generalized Bandsplit Neural Network for Cinematic Audio Source SeparationCode1
Deep Audio Waveform PriorCode1
Separate What You Describe: Language-Queried Audio Source SeparationCode1
Unsupervised Music Source Separation Using Differentiable Parametric Source ModelsCode1
Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled DataCode1
Hybrid Neural Networks for On-device Directional HearingCode1
Transfer Learning with Jukebox for Music Source SeparationCode1
Unsupervised Source Separation By Steering Pretrained Music ModelsCode1
The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World SoundtracksCode1
Unsupervised Source Separation via Bayesian Inference in the Latent DomainCode1
Multi-Task Audio Source SeparationCode1
Parallel and Flexible Sampling from Autoregressive Models via Langevin DynamicsCode1
Differentiable Model Compression via Pseudo Quantization NoiseCode1
Compute and memory efficient universal sound source separationCode1
Directional Sparse Filtering using Weighted Lehmer Mean for Blind Separation of Unbalanced Speech MixturesCode1
Unified Gradient Reweighting for Model Biasing with Applications to Source SeparationCode1
The Cone of Silence: Speech Separation by LocalizationCode1
AutoClip: Adaptive Gradient Clipping for Source Separation NetworksCode1
Sudo rm -rf: Efficient Networks for Universal Audio Source SeparationCode1
OtoWorld: Towards Learning to Separate by Learning to MoveCode1
Unsupervised Audio Source Separation using Generative PriorsCode1
Time-Domain Audio Source Separation Based on Wave-U-Net Combined with Discrete Wavelet TransformCode1
Wave-U-Net: A Multi-Scale Neural Network for End-to-End Audio Source SeparationCode1
Towards Reliable Objective Evaluation Metrics for Generative Singing Voice Separation ModelsCode0
DGMO: Training-Free Audio Source Separation through Diffusion-Guided Mask Optimization0
ZeroSep: Separate Anything in Audio with Zero Training0
Text-Queried Audio Source Separation via Hierarchical Modeling0
Score Distillation Sampling for Audio: Source Separation, Synthesis, and Beyond0
Automatic Identification of Samples in Hip-Hop Music via Multi-Loss Training and an Artificial Dataset0
Study of the Performance of CEEMDAN in Underdetermined Speech Separation0
Task-Aware Unified Source Separation0
Semantic Grouping Network for Audio Source Separation0
Performance Improvement of Language-Queried Audio Source Separation Based on Caption Augmentation From Large Language Models for DCASE Challenge 2024 Task 90
Low algorithmic delay implementation of convolutional beamformer for online joint source separation and dereverberation0
Gull: A Generative Multifunctional Audio Codec0
Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation0
GASS: Generalizing Audio Source Separation with Large-scale Data0
Language-Guided Audio-Visual Source Separation via Trimodal Consistency0
Separate And Diffuse: Using a Pretrained Diffusion Model for Improving Source Separation0
Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks0
Hyperbolic Audio Source Separation0
Show:102550
← PrevPage 1 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ST-SED-SEPSDR10.55Unverified
2Co-SeparationSDR4.26Unverified
#ModelMetricClaimedVerifiedStatus
1Co-SeparationSAR11.3Unverified