| Generative Speech Foundation Model Pretraining for High-Quality Speech Extraction and Restoration | Sep 24, 2024 | Bandwidth ExtensionDenoising | CodeCode Available | 0 |
| Look Once to Hear: Target Speech Hearing with Noisy Examples | May 10, 2024 | CPUSpeech Extraction | CodeCode Available | 4 |
| Separate in the Speech Chain: Cross-Modal Conditional Audio-Visual Target Speech Extraction | Apr 19, 2024 | Speech Extraction | —Unverified | 0 |
| Probing Self-supervised Learning Models with Target Speech Extraction | Feb 17, 2024 | Self-Supervised LearningSpeaker Identification | —Unverified | 0 |
| Target Speech Extraction with Pre-trained Self-supervised Learning Models | Feb 17, 2024 | Self-Supervised LearningSpeech Extraction | —Unverified | 0 |
| Self-Supervised Disentangled Representation Learning for Robust Target Speech Extraction | Dec 16, 2023 | DisentanglementRepresentation Learning | —Unverified | 0 |
| Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction | Oct 30, 2023 | Speaker SeparationSpeech Enhancement | —Unverified | 0 |
| AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data | Sep 25, 2023 | Automatic Speech RecognitionSpeech Enhancement | —Unverified | 0 |
| DDTSE: Discriminative Diffusion Model for Target Speech Extraction | Sep 25, 2023 | modelSpeech Enhancement | —Unverified | 0 |
| Target Speech Extraction with Conditional Diffusion Model | Aug 8, 2023 | Denoisingmodel | —Unverified | 0 |