SOTAVerified

Target Speaker Extraction

Extract the dialogue content of the specified target in a multi-person dialogue.

Papers

Showing 2650 of 55 papers

TitleStatusHype
Audio-Visual Target Speaker Extraction with Reverse Selective Auditory AttentionCode1
Enhancing Real-World Active Speaker Detection with Multi-Modal Extraction Pre-Training0
Target Speaker Extraction by Directly Exploiting Contextual Information in the Time-Frequency Domain0
Listening to Multi-talker Conversations: Modular and End-to-end Perspectives0
A Single Speech Enhancement Model Unifying Dereverberation, Denoising, Speaker Counting, Separation, and Extraction0
Typing to Listen at the Cocktail Party: Text-Guided Target Speaker ExtractionCode1
Conditional Diffusion Model for Target Speaker Extraction0
RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech SeparationCode1
The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction0
SpeechX: Neural Codec Language Model as a Versatile Speech Transformer0
Beamformer-Guided Target Speaker Extraction0
Multi-Channel Target Speaker Extraction with Refinement: The WavLab Submission to the Second Clarity Enhancement Challenge0
Improving Target Speaker Extraction with Sparse LDA-transformed Speaker Embeddings0
GPU-accelerated Guided Source Separation for Meeting TranscriptionCode1
ExARN: self-attending RNN for target speaker extraction0
Adapting self-supervised models to multi-talker speech recognition using speaker embeddings0
ImagineNET: Target Speaker Extraction with Intermittent Visual Cue through Embedding InpaintingCode0
Exploiting spatial information with the informed complex-valued spatial autoencoder for target speaker extraction0
Semi-supervised Time Domain Target Speaker Extraction with Attention0
Speaker-conditioning Single-channel Target Speaker Extraction using Conformer-based Architectures0
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker ExtractionCode1
Coarse-to-Fine Recursive Speech Separation for Unknown Number of Speakers0
L-SpEx: Localized Target Speaker ExtractionCode1
New Insights on Target Speaker Extraction0
Selective Listening by Synchronizing Speech with LipsCode1
Show:102550
← PrevPage 2 of 3Next →

No leaderboard results yet.