| Incorporating Linguistic Constraints from External Knowledge Source for Audio-Visual Target Speech Extraction | Jun 11, 2025 | Speech ExtractionTarget Speaker Extraction | —Unverified | 0 |
| Inter-Speaker Relative Cues for Text-Guided Target Speech Extraction | Jun 2, 2025 | AttributeSpeech Extraction | —Unverified | 0 |
| SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline | May 25, 2025 | Speech ExtractionSpeech Separation | CodeCode Available | 3 |
| Single-Channel Target Speech Extraction Utilizing Distance and Room Clues | May 20, 2025 | Speech ExtractionSpeech Separation | —Unverified | 0 |
| SonicSieve: Bringing Directional Speech Extraction to Smartphones Using Acoustic Microstructures | Apr 15, 2025 | Speech Extraction | —Unverified | 0 |
| Contextual Speech Extraction: Leveraging Textual History as an Implicit Cue for Target Speech Extraction | Mar 11, 2025 | Speech Extraction | —Unverified | 0 |
| Enhancing Intelligibility for Generative Target Speech Extraction via Joint Optimization with Target Speaker ASR | Jan 24, 2025 | Speech Extraction | —Unverified | 0 |
| Beyond Speaker Identity: Text Guided Target Speech Extraction | Jan 15, 2025 | Speech ExtractionSpeech Separation | CodeCode Available | 0 |
| Distance Based Single-Channel Target Speech Extraction | Dec 28, 2024 | Speech Extraction | —Unverified | 0 |
| Investigation of Speaker Representation for Target-Speaker Speech Processing | Oct 15, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| Generative Speech Foundation Model Pretraining for High-Quality Speech Extraction and Restoration | Sep 24, 2024 | Bandwidth ExtensionDenoising | CodeCode Available | 0 |
| Look Once to Hear: Target Speech Hearing with Noisy Examples | May 10, 2024 | CPUSpeech Extraction | CodeCode Available | 4 |
| Separate in the Speech Chain: Cross-Modal Conditional Audio-Visual Target Speech Extraction | Apr 19, 2024 | Speech Extraction | —Unverified | 0 |
| Probing Self-supervised Learning Models with Target Speech Extraction | Feb 17, 2024 | Self-Supervised LearningSpeaker Identification | —Unverified | 0 |
| Target Speech Extraction with Pre-trained Self-supervised Learning Models | Feb 17, 2024 | Self-Supervised LearningSpeech Extraction | —Unverified | 0 |
| Self-Supervised Disentangled Representation Learning for Robust Target Speech Extraction | Dec 16, 2023 | DisentanglementRepresentation Learning | —Unverified | 0 |
| Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction | Oct 30, 2023 | Speaker SeparationSpeech Enhancement | —Unverified | 0 |
| AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data | Sep 25, 2023 | Automatic Speech RecognitionSpeech Enhancement | —Unverified | 0 |
| DDTSE: Discriminative Diffusion Model for Target Speech Extraction | Sep 25, 2023 | modelSpeech Enhancement | —Unverified | 0 |
| Target Speech Extraction with Conditional Diffusion Model | Aug 8, 2023 | Denoisingmodel | —Unverified | 0 |
| Enhanced Neural Beamformer with Spatial Information for Target Speech Extraction | Jun 28, 2023 | Dimensionality ReductionSpeech Extraction | —Unverified | 0 |
| Audio-Visual Speech Enhancement With Selective Off-Screen Speech Extraction | Jun 10, 2023 | Computational EfficiencySpeech Enhancement | —Unverified | 0 |
| Knowledge Distillation for Neural Transducer-based Target-Speaker ASR: Exploiting Parallel Mixture/Single-Talker Speech Data | May 25, 2023 | Knowledge DistillationSpeech Extraction | —Unverified | 0 |
| X-SepFormer: End-to-end Speaker Extraction Network with Explicit Optimization on Speaker Confusion | Mar 9, 2023 | Speech Extraction | —Unverified | 0 |
| Neural Target Speech Extraction: An Overview | Jan 31, 2023 | Speech Extraction | CodeCode Available | 1 |