SOTAVerified

Target Sound Extraction

Target Sound Extraction is the task of extracting a sound corresponding to a given class from an audio mixture. The audio mixture may contain background noise with a relatively low amplitude compared to the foreground mixture components. The choice of the sound class is provided as input to the model in form of a string, integer, or a one-hot encoding of the sound class.

Papers

Showing 1116 of 16 papers

TitleStatusHype
Semantic Hearing: Programming Acoustic Scenes with Binaural HearablesCode1
DPM-TSE: A Diffusion Probabilistic Model for Target Sound ExtractionCode1
Target Sound Extraction with Variable Cross-modality CluesCode1
Real-Time Target Sound ExtractionCode2
SoundBeam: Target sound extraction conditioned on sound-class labels and enrollment clues for increased performance and continuous learning0
Few-shot learning of new sound classes for target sound extraction0
Show:102550
← PrevPage 2 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CLAPSepSDRi10.08Unverified
#ModelMetricClaimedVerifiedStatus
1CLAPSepSDRi9.29Unverified
#ModelMetricClaimedVerifiedStatus
1WaveformerSI-SNRi9.43Unverified