| Personalized Keyphrase Detection using Speaker and Environment Information | Apr 28, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Phasebook and Friends: Leveraging Discrete Representations for Source Separation | Oct 2, 2018 | Speaker SeparationSpeech Enhancement | —Unverified | 0 | 0 |
| Practical applicability of deep neural networks for overlapping speaker separation | Dec 19, 2019 | ClusteringDeep Clustering | —Unverified | 0 | 0 |
| Quantitative Evidence on Overlooked Aspects of Enrollment Speaker Embeddings for Target Speaker Separation | Oct 23, 2022 | Speaker IdentificationSpeaker Separation | —Unverified | 0 | 0 |
| Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction | Oct 30, 2023 | Speaker SeparationSpeech Enhancement | —Unverified | 0 | 0 |
| SC-SOT: Conditioning the Decoder on Diarized Speaker Information for End-to-End Overlapped Speech Recognition | Jun 15, 2025 | Decoderspeaker-diarization | —Unverified | 0 | 0 |
| Seeing Through Noise: Visually Driven Speaker Separation and Enhancement | Aug 22, 2017 | Speaker Separation | —Unverified | 0 | 0 |
| Single-Microphone Speaker Separation and Voice Activity Detection in Noisy and Reverberant Environments | Jan 7, 2024 | Action DetectionActivity Detection | —Unverified | 0 | 0 |
| Spatial-Temporal Activity-Informed Diarization and Separation | Jan 30, 2024 | speaker-diarizationSpeaker Diarization | —Unverified | 0 | 0 |
| Speaker Separation Using Speaker Inventories and Estimated Speech | Oct 20, 2020 | Speaker SeparationSpeech Extraction | —Unverified | 0 | 0 |