SOTAVerified

Speech Extraction

Papers

Showing 125 of 48 papers

TitleStatusHype
Look Once to Hear: Target Speech Hearing with Noisy ExamplesCode4
SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative PipelineCode3
DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And ExtractionCode1
Neural Target Speech Extraction: An OverviewCode1
Improving speaker discrimination of target speech extraction with time-domain SpeakerBeamCode1
On the Role of Spatial, Spectral, and Temporal Processing for DNN-based Non-linear Multi-channel Speech EnhancementCode1
Contextual Speech Extraction: Leveraging Textual History as an Implicit Cue for Target Speech Extraction0
Deep Learning-Based Joint Control of Acoustic Echo Cancellation, Beamforming and Postfiltering0
DDTSE: Discriminative Diffusion Model for Target Speech Extraction0
Distance Based Single-Channel Target Speech Extraction0
Dual-Path Cross-Modal Attention for better Audio-Visual Speech Extraction0
Enhanced Neural Beamformer with Spatial Information for Target Speech Extraction0
Enhancing Intelligibility for Generative Target Speech Extraction via Joint Optimization with Target Speaker ASR0
Generative Speech Foundation Model Pretraining for High-Quality Speech Extraction and Restoration0
Improving Channel Decorrelation for Multi-Channel Target Speech Extraction0
Incorporating Linguistic Constraints from External Knowledge Source for Audio-Visual Target Speech Extraction0
Inter-Speaker Relative Cues for Text-Guided Target Speech Extraction0
Investigation of Speaker Representation for Target-Speaker Speech Processing0
Knowledge Distillation for Neural Transducer-based Target-Speaker ASR: Exploiting Parallel Mixture/Single-Talker Speech Data0
Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features0
Listen only to me! How well can target speech extraction handle false alarms?0
Multi-talker ASR for an unknown number of sources: Joint training of source counting, separation and ASR0
X-SepFormer: End-to-end Speaker Extraction Network with Explicit Optimization on Speaker Confusion0
Attention-based scaling adaptation for target speech extraction0
Audio-Visual Speech Enhancement With Selective Off-Screen Speech Extraction0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.