| Look Once to Hear: Target Speech Hearing with Noisy Examples | May 10, 2024 | CPUSpeech Extraction | CodeCode Available | 4 |
| SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline | May 25, 2025 | Speech ExtractionSpeech Separation | CodeCode Available | 3 |
| Neural Target Speech Extraction: An Overview | Jan 31, 2023 | Speech Extraction | CodeCode Available | 1 |
| On the Role of Spatial, Spectral, and Temporal Processing for DNN-based Non-linear Multi-channel Speech Enhancement | Jun 22, 2022 | Speech EnhancementSpeech Extraction | CodeCode Available | 1 |
| DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And Extraction | Dec 27, 2021 | Speech ExtractionSpeech Separation | CodeCode Available | 1 |
| Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam | Jan 23, 2020 | Speaker IdentificationSpeech Extraction | CodeCode Available | 1 |
| Contextual Speech Extraction: Leveraging Textual History as an Implicit Cue for Target Speech Extraction | Mar 11, 2025 | Speech Extraction | —Unverified | 0 |
| Deep Learning-Based Joint Control of Acoustic Echo Cancellation, Beamforming and Postfiltering | Mar 3, 2022 | Acoustic echo cancellationSpeech Extraction | —Unverified | 0 |
| DDTSE: Discriminative Diffusion Model for Target Speech Extraction | Sep 25, 2023 | modelSpeech Enhancement | —Unverified | 0 |
| Distance Based Single-Channel Target Speech Extraction | Dec 28, 2024 | Speech Extraction | —Unverified | 0 |
| Dual-Path Cross-Modal Attention for better Audio-Visual Speech Extraction | Jul 9, 2022 | Speech ExtractionSpeech Separation | —Unverified | 0 |
| Enhanced Neural Beamformer with Spatial Information for Target Speech Extraction | Jun 28, 2023 | Dimensionality ReductionSpeech Extraction | —Unverified | 0 |
| Enhancing Intelligibility for Generative Target Speech Extraction via Joint Optimization with Target Speaker ASR | Jan 24, 2025 | Speech Extraction | —Unverified | 0 |
| Improving Channel Decorrelation for Multi-Channel Target Speech Extraction | Jun 6, 2021 | Speech Extraction | —Unverified | 0 |
| Incorporating Linguistic Constraints from External Knowledge Source for Audio-Visual Target Speech Extraction | Jun 11, 2025 | Speech ExtractionTarget Speaker Extraction | —Unverified | 0 |
| Inter-Speaker Relative Cues for Text-Guided Target Speech Extraction | Jun 2, 2025 | AttributeSpeech Extraction | —Unverified | 0 |
| Investigation of Speaker Representation for Target-Speaker Speech Processing | Oct 15, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| Knowledge Distillation for Neural Transducer-based Target-Speaker ASR: Exploiting Parallel Mixture/Single-Talker Speech Data | May 25, 2023 | Knowledge DistillationSpeech Extraction | —Unverified | 0 |
| Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features | Dec 10, 2021 | Speech EnhancementSpeech Extraction | —Unverified | 0 |
| Listen only to me! How well can target speech extraction handle false alarms? | Apr 11, 2022 | Speaker IdentificationSpeaker Verification | —Unverified | 0 |
| X-SepFormer: End-to-end Speaker Extraction Network with Explicit Optimization on Speaker Confusion | Mar 9, 2023 | Speech Extraction | —Unverified | 0 |
| Attention-based scaling adaptation for target speech extraction | Oct 19, 2020 | Speech Extraction | —Unverified | 0 |
| Audio-Visual Speech Enhancement With Selective Off-Screen Speech Extraction | Jun 10, 2023 | Computational EfficiencySpeech Enhancement | —Unverified | 0 |
| AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data | Sep 25, 2023 | Automatic Speech RecognitionSpeech Enhancement | —Unverified | 0 |
| Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification | Nov 5, 2021 | Speaker IdentificationSpeech Extraction | —Unverified | 0 |