| Look Once to Hear: Target Speech Hearing with Noisy Examples | May 10, 2024 | CPUSpeech Extraction | CodeCode Available | 4 | 5 |
| SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline | May 25, 2025 | Speech ExtractionSpeech Separation | CodeCode Available | 3 | 5 |
| Neural Target Speech Extraction: An Overview | Jan 31, 2023 | Speech Extraction | CodeCode Available | 1 | 5 |
| On the Role of Spatial, Spectral, and Temporal Processing for DNN-based Non-linear Multi-channel Speech Enhancement | Jun 22, 2022 | Speech EnhancementSpeech Extraction | CodeCode Available | 1 | 5 |
| DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And Extraction | Dec 27, 2021 | Speech ExtractionSpeech Separation | CodeCode Available | 1 | 5 |
| Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam | Jan 23, 2020 | Speaker IdentificationSpeech Extraction | CodeCode Available | 1 | 5 |
| Beyond Speaker Identity: Text Guided Target Speech Extraction | Jan 15, 2025 | Speech ExtractionSpeech Separation | CodeCode Available | 0 | 5 |
| Analysis of impact of emotions on target speech extraction and speech separation | Aug 15, 2022 | Speaker VerificationSpeech Extraction | CodeCode Available | 0 | 5 |
| Generative Speech Foundation Model Pretraining for High-Quality Speech Extraction and Restoration | Sep 24, 2024 | Bandwidth ExtensionDenoising | CodeCode Available | 0 | 5 |
| Target Speech Extraction Based on Blind Source Separation and X-vector-based Speaker Selection Trained with Data Augmentation | May 16, 2020 | blind source separationData Augmentation | CodeCode Available | 0 | 5 |
| Dual-Path Cross-Modal Attention for better Audio-Visual Speech Extraction | Jul 9, 2022 | Speech ExtractionSpeech Separation | —Unverified | 0 | 0 |
| Enhanced Neural Beamformer with Spatial Information for Target Speech Extraction | Jun 28, 2023 | Dimensionality ReductionSpeech Extraction | —Unverified | 0 | 0 |
| Enhancing Intelligibility for Generative Target Speech Extraction via Joint Optimization with Target Speaker ASR | Jan 24, 2025 | Speech Extraction | —Unverified | 0 | 0 |
| Improving Channel Decorrelation for Multi-Channel Target Speech Extraction | Jun 6, 2021 | Speech Extraction | —Unverified | 0 | 0 |
| Incorporating Linguistic Constraints from External Knowledge Source for Audio-Visual Target Speech Extraction | Jun 11, 2025 | Speech ExtractionTarget Speaker Extraction | —Unverified | 0 | 0 |
| Inter-Speaker Relative Cues for Text-Guided Target Speech Extraction | Jun 2, 2025 | AttributeSpeech Extraction | —Unverified | 0 | 0 |
| Investigation of Speaker Representation for Target-Speaker Speech Processing | Oct 15, 2024 | Action DetectionActivity Detection | —Unverified | 0 | 0 |
| Knowledge Distillation for Neural Transducer-based Target-Speaker ASR: Exploiting Parallel Mixture/Single-Talker Speech Data | May 25, 2023 | Knowledge DistillationSpeech Extraction | —Unverified | 0 | 0 |
| Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features | Dec 10, 2021 | Speech EnhancementSpeech Extraction | —Unverified | 0 | 0 |
| Listen only to me! How well can target speech extraction handle false alarms? | Apr 11, 2022 | Speaker IdentificationSpeaker Verification | —Unverified | 0 | 0 |
| Multi-talker ASR for an unknown number of sources: Joint training of source counting, separation and ASR | Jun 4, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Probing Self-supervised Learning Models with Target Speech Extraction | Feb 17, 2024 | Self-Supervised LearningSpeaker Identification | —Unverified | 0 | 0 |
| Algorithm for Independent Vector Extraction Based on Semi-Time-Variant Mixing Model | Oct 22, 2019 | Speech Extraction | —Unverified | 0 | 0 |
| Quotations, Coreference Resolution, and Sentiment Annotations in Croatian News Articles: An Exploratory Study | Dec 14, 2022 | Articlescoreference-resolution | —Unverified | 0 | 0 |
| Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction | Oct 30, 2023 | Speaker SeparationSpeech Enhancement | —Unverified | 0 | 0 |