SOTAVerified

Target Speaker Extraction

Extract the dialogue content of the specified target in a multi-person dialogue.

Papers

Showing 110 of 55 papers

TitleStatusHype
Incorporating Linguistic Constraints from External Knowledge Source for Audio-Visual Target Speech Extraction0
M3ANet: Multi-scale and Multi-Modal Alignment Network for Brain-Assisted Target Speaker ExtractionCode0
FlowTSE: Target Speaker Extraction with Flow Matching0
Listen to Extract: Onset-Prompted Target Speaker Extraction0
LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language ModelsCode1
C^2AV-TSE: Context and Confidence-aware Audio Visual Target Speaker Extraction0
Target Speaker Extraction through Comparing Noisy Positive and Negative Audio Enrollments0
Metis: A Foundation Speech Generation Model with Masked Generative Pre-trainingCode9
AnyEnhance: A Unified Generative Model with Prompt-Guidance and Self-Critic for Voice Enhancement0
Universal Speaker Embedding Free Target Speaker Extraction and Personal Voice Activity Detection0
Show:102550
← PrevPage 1 of 6Next →

No leaderboard results yet.