| Look Once to Hear: Target Speech Hearing with Noisy Examples | May 10, 2024 | CPUSpeech Extraction | CodeCode Available | 4 |
| SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline | May 25, 2025 | Speech ExtractionSpeech Separation | CodeCode Available | 3 |
| Neural Target Speech Extraction: An Overview | Jan 31, 2023 | Speech Extraction | CodeCode Available | 1 |
| On the Role of Spatial, Spectral, and Temporal Processing for DNN-based Non-linear Multi-channel Speech Enhancement | Jun 22, 2022 | Speech EnhancementSpeech Extraction | CodeCode Available | 1 |
| DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And Extraction | Dec 27, 2021 | Speech ExtractionSpeech Separation | CodeCode Available | 1 |
| Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam | Jan 23, 2020 | Speaker IdentificationSpeech Extraction | CodeCode Available | 1 |
| Incorporating Linguistic Constraints from External Knowledge Source for Audio-Visual Target Speech Extraction | Jun 11, 2025 | Speech ExtractionTarget Speaker Extraction | —Unverified | 0 |
| Inter-Speaker Relative Cues for Text-Guided Target Speech Extraction | Jun 2, 2025 | AttributeSpeech Extraction | —Unverified | 0 |
| Single-Channel Target Speech Extraction Utilizing Distance and Room Clues | May 20, 2025 | Speech ExtractionSpeech Separation | —Unverified | 0 |
| SonicSieve: Bringing Directional Speech Extraction to Smartphones Using Acoustic Microstructures | Apr 15, 2025 | Speech Extraction | —Unverified | 0 |
| Contextual Speech Extraction: Leveraging Textual History as an Implicit Cue for Target Speech Extraction | Mar 11, 2025 | Speech Extraction | —Unverified | 0 |
| Enhancing Intelligibility for Generative Target Speech Extraction via Joint Optimization with Target Speaker ASR | Jan 24, 2025 | Speech Extraction | —Unverified | 0 |
| Beyond Speaker Identity: Text Guided Target Speech Extraction | Jan 15, 2025 | Speech ExtractionSpeech Separation | CodeCode Available | 0 |
| Distance Based Single-Channel Target Speech Extraction | Dec 28, 2024 | Speech Extraction | —Unverified | 0 |
| Investigation of Speaker Representation for Target-Speaker Speech Processing | Oct 15, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| Generative Speech Foundation Model Pretraining for High-Quality Speech Extraction and Restoration | Sep 24, 2024 | Bandwidth ExtensionDenoising | CodeCode Available | 0 |
| Separate in the Speech Chain: Cross-Modal Conditional Audio-Visual Target Speech Extraction | Apr 19, 2024 | Speech Extraction | —Unverified | 0 |
| Probing Self-supervised Learning Models with Target Speech Extraction | Feb 17, 2024 | Self-Supervised LearningSpeaker Identification | —Unverified | 0 |
| Target Speech Extraction with Pre-trained Self-supervised Learning Models | Feb 17, 2024 | Self-Supervised LearningSpeech Extraction | —Unverified | 0 |
| Self-Supervised Disentangled Representation Learning for Robust Target Speech Extraction | Dec 16, 2023 | DisentanglementRepresentation Learning | —Unverified | 0 |
| Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction | Oct 30, 2023 | Speaker SeparationSpeech Enhancement | —Unverified | 0 |
| DDTSE: Discriminative Diffusion Model for Target Speech Extraction | Sep 25, 2023 | modelSpeech Enhancement | —Unverified | 0 |
| AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data | Sep 25, 2023 | Automatic Speech RecognitionSpeech Enhancement | —Unverified | 0 |
| Target Speech Extraction with Conditional Diffusion Model | Aug 8, 2023 | Denoisingmodel | —Unverified | 0 |
| Enhanced Neural Beamformer with Spatial Information for Target Speech Extraction | Jun 28, 2023 | Dimensionality ReductionSpeech Extraction | —Unverified | 0 |
| Audio-Visual Speech Enhancement With Selective Off-Screen Speech Extraction | Jun 10, 2023 | Computational EfficiencySpeech Enhancement | —Unverified | 0 |
| Knowledge Distillation for Neural Transducer-based Target-Speaker ASR: Exploiting Parallel Mixture/Single-Talker Speech Data | May 25, 2023 | Knowledge DistillationSpeech Extraction | —Unverified | 0 |
| X-SepFormer: End-to-end Speaker Extraction Network with Explicit Optimization on Speaker Confusion | Mar 9, 2023 | Speech Extraction | —Unverified | 0 |
| Quotations, Coreference Resolution, and Sentiment Annotations in Croatian News Articles: An Exploratory Study | Dec 14, 2022 | Articlescoreference-resolution | —Unverified | 0 |
| Streaming Target-Speaker ASR with Neural Transducer | Sep 9, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Analysis of impact of emotions on target speech extraction and speech separation | Aug 15, 2022 | Speaker VerificationSpeech Extraction | CodeCode Available | 0 |
| ConceptBeam: Concept Driven Target Speech Extraction | Jul 25, 2022 | Metric LearningSpeech Extraction | —Unverified | 0 |
| Dual-Path Cross-Modal Attention for better Audio-Visual Speech Extraction | Jul 9, 2022 | Speech ExtractionSpeech Separation | —Unverified | 0 |
| Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios | Jun 17, 2022 | Action DetectionActivity Detection | —Unverified | 0 |
| Strategies to Improve Robustness of Target Speech Extraction to Enrollment Variations | Jun 16, 2022 | Speaker IdentificationSpeech Extraction | —Unverified | 0 |
| Listen only to me! How well can target speech extraction handle false alarms? | Apr 11, 2022 | Speaker IdentificationSpeaker Verification | —Unverified | 0 |
| Deep Learning-Based Joint Control of Acoustic Echo Cancellation, Beamforming and Postfiltering | Mar 3, 2022 | Acoustic echo cancellationSpeech Extraction | —Unverified | 0 |
| Learning-based personal speech enhancement for teleconferencing by exploiting spatial-spectral features | Dec 10, 2021 | Speech EnhancementSpeech Extraction | —Unverified | 0 |
| Target Speech Extraction: Independent Vector Extraction Guided by Supervised Speaker Identification | Nov 5, 2021 | Speaker IdentificationSpeech Extraction | —Unverified | 0 |
| All-neural beamformer for continuous speech separation | Oct 13, 2021 | AllAutomatic Speech Recognition | —Unverified | 0 |
| Improving Channel Decorrelation for Multi-Channel Target Speech Extraction | Jun 6, 2021 | Speech Extraction | —Unverified | 0 |
| Time-Domain Speech Extraction with Spatial Information and Multi Speaker Conditioning Mechanism | Feb 7, 2021 | Speech Extractionspeech-recognition | —Unverified | 0 |
| Speaker activity driven neural speech extraction | Jan 14, 2021 | Speech Extraction | —Unverified | 0 |
| Speaker Separation Using Speaker Inventories and Estimated Speech | Oct 20, 2020 | Speaker SeparationSpeech Extraction | —Unverified | 0 |
| Attention-based scaling adaptation for target speech extraction | Oct 19, 2020 | Speech Extraction | —Unverified | 0 |
| Multi-talker ASR for an unknown number of sources: Joint training of source counting, separation and ASR | Jun 4, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Target Speech Extraction Based on Blind Source Separation and X-vector-based Speaker Selection Trained with Data Augmentation | May 16, 2020 | blind source separationData Augmentation | CodeCode Available | 0 |
| Algorithm for Independent Vector Extraction Based on Semi-Time-Variant Mixing Model | Oct 22, 2019 | Speech Extraction | —Unverified | 0 |