SOTAVerified

Audio Source Separation

Audio Source Separation is the process of separating a mixture (e.g. a pop band recording) into isolated sounds from individual sources (e.g. just the lead vocals).

Source: Model selection for deep audio source separation via clustering analysis

Papers

Showing 150 of 112 papers

TitleStatusHype
Towards Reliable Objective Evaluation Metrics for Generative Singing Voice Separation ModelsCode0
DGMO: Training-Free Audio Source Separation through Diffusion-Guided Mask Optimization0
ZeroSep: Separate Anything in Audio with Zero Training0
Text-Queried Audio Source Separation via Hierarchical Modeling0
Training-Free Multi-Step Audio Source SeparationCode2
Score Distillation Sampling for Audio: Source Separation, Synthesis, and Beyond0
Automatic Identification of Samples in Hip-Hop Music via Multi-Loss Training and an Artificial Dataset0
Study of the Performance of CEEMDAN in Underdetermined Speech Separation0
Leveraging LLM and Text-Queried Separation for Noise-Robust Sound Event DetectionCode1
Task-Aware Unified Source Separation0
Exploring Text-Queried Sound Event Detection with Audio Source SeparationCode1
Unsupervised Composable Representations for AudioCode1
Facing the Music: Tackling Singing Voice Separation in Cinematic Audio Source SeparationCode1
Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual SupportCode1
Semantic Grouping Network for Audio Source Separation0
A Stem-Agnostic Single-Decoder System for Music Source Separation Beyond Four StemsCode2
Performance Improvement of Language-Queried Audio Source Separation Based on Caption Augmentation From Large Language Models for DCASE Challenge 2024 Task 90
Low algorithmic delay implementation of convolutional beamformer for online joint source separation and dereverberation0
Spectral Mapping of Singing Voices: U-Net-Assisted Vocal SegmentationCode1
Gull: A Generative Multifunctional Audio Codec0
Mixture of Dynamical Variational Autoencoders for Multi-Source Trajectory Modeling and Separation0
GASS: Generalizing Audio Source Separation with Large-scale Data0
A Generalized Bandsplit Neural Network for Cinematic Audio Source SeparationCode1
Separate Anything You DescribeCode3
Language-Guided Audio-Visual Source Separation via Trimodal Consistency0
Separate And Diffuse: Using a Pretrained Diffusion Model for Improving Source Separation0
Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks0
Hyperbolic Audio Source Separation0
Differentiable Dictionary Search: Integrating Linear Mixing with Deep Non-Linear Modelling for Audio Source Separation0
Learning Audio-Visual Dynamics Using Scene Graphs for Audio Source SeparationCode0
Deep Audio Waveform PriorCode1
Hierarchic Temporal Convolutional Network With Cross-Domain Encoder for Music Source Separation0
Sampling Frequency Independent Dialogue Separation0
SepIt: Approaching a Single Channel Speech Separation Bound0
Separate What You Describe: Language-Queried Audio Source SeparationCode1
On loss functions and evaluation metrics for music source separation0
Differentiable Digital Signal Processing Mixture Model for Synthesis Parameter Extraction from Mixture of Harmonic Sounds0
Unsupervised Music Source Separation Using Differentiable Parametric Source ModelsCode1
Fish sounds: towards the evaluation of marine acoustic biodiversity through data-driven audio source separation0
Self-Supervised Beat Tracking in Musical Signals with Polyphonic Contrastive Learning0
Zero-shot Audio Source Separation through Query-based Learningfrom Weakly-labeled Data0
Zero-shot Audio Source Separation through Query-based Learning from Weakly-labeled DataCode1
Hybrid Neural Networks for On-device Directional HearingCode1
Transfer Learning with Jukebox for Music Source SeparationCode1
Reduction of Subjective Listening Effort for TV Broadcast Signals with Recurrent Neural Networks0
Unsupervised Source Separation By Steering Pretrained Music ModelsCode1
The Cocktail Fork Problem: Three-Stem Audio Separation for Real-World SoundtracksCode1
Unsupervised Source Separation via Bayesian Inference in the Latent DomainCode1
Visual Scene Graphs for Audio Source SeparationCode0
Multi-Task Audio Source SeparationCode1
Show:102550
← PrevPage 1 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ST-SED-SEPSDR10.55Unverified
2Co-SeparationSDR4.26Unverified
#ModelMetricClaimedVerifiedStatus
1Co-SeparationSAR11.3Unverified