SOTAVerified

Target Speaker Extraction

Extract the dialogue content of the specified target in a multi-person dialogue.

Papers

Showing 110 of 55 papers

TitleStatusHype
Metis: A Foundation Speech Generation Model with Masked Generative Pre-trainingCode9
Multi-Level Speaker Representation for Target Speaker ExtractionCode3
WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker ExtractionCode3
TSELM: Target Speaker Extraction using Discrete Tokens and Language ModelsCode2
AV-CrossNet: an Audiovisual Complex Spectral Mapping Network for Speech Separation By Leveraging Narrow- and Cross-Band ModelingCode1
L-SpEx: Localized Target Speaker ExtractionCode1
Muse: Multi-modal target speaker extraction with visual cuesCode1
Audio-Visual Target Speaker Extraction with Reverse Selective Auditory AttentionCode1
GPU-accelerated Guided Source Separation for Meeting TranscriptionCode1
A Hybrid Continuity Loss to Reduce Over-Suppression for Time-domain Target Speaker ExtractionCode1
Show:102550
← PrevPage 1 of 6Next →

No leaderboard results yet.