SOTAVerified

Speech Extraction

Papers

Showing 125 of 48 papers

TitleStatusHype
Incorporating Linguistic Constraints from External Knowledge Source for Audio-Visual Target Speech Extraction0
Inter-Speaker Relative Cues for Text-Guided Target Speech Extraction0
SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative PipelineCode3
Single-Channel Target Speech Extraction Utilizing Distance and Room Clues0
SonicSieve: Bringing Directional Speech Extraction to Smartphones Using Acoustic Microstructures0
Contextual Speech Extraction: Leveraging Textual History as an Implicit Cue for Target Speech Extraction0
Enhancing Intelligibility for Generative Target Speech Extraction via Joint Optimization with Target Speaker ASR0
Beyond Speaker Identity: Text Guided Target Speech ExtractionCode0
Distance Based Single-Channel Target Speech Extraction0
Investigation of Speaker Representation for Target-Speaker Speech Processing0
Generative Speech Foundation Model Pretraining for High-Quality Speech Extraction and RestorationCode0
Look Once to Hear: Target Speech Hearing with Noisy ExamplesCode4
Separate in the Speech Chain: Cross-Modal Conditional Audio-Visual Target Speech Extraction0
Probing Self-supervised Learning Models with Target Speech Extraction0
Target Speech Extraction with Pre-trained Self-supervised Learning Models0
Self-Supervised Disentangled Representation Learning for Robust Target Speech Extraction0
Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction0
AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data0
DDTSE: Discriminative Diffusion Model for Target Speech Extraction0
Target Speech Extraction with Conditional Diffusion Model0
Enhanced Neural Beamformer with Spatial Information for Target Speech Extraction0
Audio-Visual Speech Enhancement With Selective Off-Screen Speech Extraction0
Knowledge Distillation for Neural Transducer-based Target-Speaker ASR: Exploiting Parallel Mixture/Single-Talker Speech Data0
X-SepFormer: End-to-end Speaker Extraction Network with Explicit Optimization on Speaker Confusion0
Neural Target Speech Extraction: An OverviewCode1
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.