SOTAVerified

Target Sound Extraction

Target Sound Extraction is the task of extracting a sound corresponding to a given class from an audio mixture. The audio mixture may contain background noise with a relatively low amplitude compared to the foreground mixture components. The choice of the sound class is provided as input to the model in form of a string, integer, or a one-hot encoding of the sound class.

Papers

Showing 1116 of 16 papers

TitleStatusHype
Leveraging Audio-Only Data for Text-Queried Target Sound Extraction0
Language-Queried Target Sound Extraction Without Parallel Training Data0
Few-shot learning of new sound classes for target sound extraction0
SoundBeam: Target sound extraction conditioned on sound-class labels and enrollment clues for increased performance and continuous learning0
SoundSculpt: Direction and Semantics Driven Ambisonic Target Sound Extraction0
CATSE: A Context-Aware Framework for Causal Target Sound Extraction0
Show:102550
← PrevPage 2 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CLAPSepSDRi10.08Unverified
#ModelMetricClaimedVerifiedStatus
1CLAPSepSDRi9.29Unverified
#ModelMetricClaimedVerifiedStatus
1WaveformerSI-SNRi9.43Unverified