SOTAVerified

Speech Extraction

Papers

Showing 148 of 48 papers

TitleStatusHype
Look Once to Hear: Target Speech Hearing with Noisy ExamplesCode4
SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative PipelineCode3
Neural Target Speech Extraction: An OverviewCode1
On the Role of Spatial, Spectral, and Temporal Processing for DNN-based Non-linear Multi-channel Speech EnhancementCode1
DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And ExtractionCode1
Improving speaker discrimination of target speech extraction with time-domain SpeakerBeamCode1
Contextual Speech Extraction: Leveraging Textual History as an Implicit Cue for Target Speech Extraction0
Deep Learning-Based Joint Control of Acoustic Echo Cancellation, Beamforming and Postfiltering0
DDTSE: Discriminative Diffusion Model for Target Speech Extraction0
Distance Based Single-Channel Target Speech Extraction0
Dual-Path Cross-Modal Attention for better Audio-Visual Speech Extraction0
Enhanced Neural Beamformer with Spatial Information for Target Speech Extraction0
Enhancing Intelligibility for Generative Target Speech Extraction via Joint Optimization with Target Speaker ASR0
Improving Channel Decorrelation for Multi-Channel Target Speech Extraction0
Incorporating Linguistic Constraints from External Knowledge Source for Audio-Visual Target Speech Extraction0
Inter-Speaker Relative Cues for Text-Guided Target Speech Extraction0
Investigation of Speaker Representation for Target-Speaker Speech Processing0
Knowledge Distillation for Neural Transducer-based Target-Speaker ASR: Exploiting Parallel Mixture/Single-Talker Speech Data0
Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features0
Listen only to me! How well can target speech extraction handle false alarms?0
X-SepFormer: End-to-end Speaker Extraction Network with Explicit Optimization on Speaker Confusion0
Attention-based scaling adaptation for target speech extraction0
Audio-Visual Speech Enhancement With Selective Off-Screen Speech Extraction0
AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data0
Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification0
ConceptBeam: Concept Driven Target Speech Extraction0
Target Speech Extraction with Conditional Diffusion Model0
Target Speech Extraction with Pre-trained Self-supervised Learning Models0
Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism0
All-neural beamformer for continuous speech separation0
Multi-talker ASR for an unknown number of sources: Joint training of source counting, separation and ASR0
Probing Self-supervised Learning Models with Target Speech Extraction0
Algorithm for Independent Vector Extraction Based on Semi-Time-Variant Mixing Model0
Quotations, Coreference Resolution, and Sentiment Annotations in Croatian News Articles: An Exploratory Study0
Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction0
Self-Supervised Disentangled Representation Learning for Robust Target Speech Extraction0
Separate in the Speech Chain: Cross-Modal Conditional Audio-Visual Target Speech Extraction0
Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios0
Single-Channel Target Speech Extraction Utilizing Distance and Room Clues0
SonicSieve: Bringing Directional Speech Extraction to Smartphones Using Acoustic Microstructures0
Speaker activity driven neural speech extraction0
Speaker Separation Using Speaker Inventories and Estimated Speech0
Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations0
Streaming Target-Speaker ASR with Neural Transducer0
Generative Speech Foundation Model Pretraining for High-Quality Speech Extraction and RestorationCode0
Analysis of impact of emotions on target speech extraction and speech separationCode0
Target Speech Extraction Based on Blind Source Separation and X-vector-based Speaker Selection Trained with Data AugmentationCode0
Beyond Speaker Identity: Text Guided Target Speech ExtractionCode0
Show:102550

No leaderboard results yet.