| Personalized Keyphrase Detection using Speaker and Environment Information | Apr 28, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Phasebook and Friends: Leveraging Discrete Representations for Source Separation | Oct 2, 2018 | Speaker SeparationSpeech Enhancement | —Unverified | 0 |
| Practical applicability of deep neural networks for overlapping speaker separation | Dec 19, 2019 | ClusteringDeep Clustering | —Unverified | 0 |
| Quantitative Evidence on Overlooked Aspects of Enrollment Speaker Embeddings for Target Speaker Separation | Oct 23, 2022 | Speaker IdentificationSpeaker Separation | —Unverified | 0 |
| Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction | Oct 30, 2023 | Speaker SeparationSpeech Enhancement | —Unverified | 0 |
| SC-SOT: Conditioning the Decoder on Diarized Speaker Information for End-to-End Overlapped Speech Recognition | Jun 15, 2025 | Decoderspeaker-diarization | —Unverified | 0 |
| Seeing Through Noise: Visually Driven Speaker Separation and Enhancement | Aug 22, 2017 | Speaker Separation | —Unverified | 0 |
| VoiceVector: Multimodal Enrolment Vectors for Speaker Separation | Jan 2, 2025 | Speaker Separation | —Unverified | 0 |
| A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings | Mar 31, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Sequential Multi-Frame Neural Beamforming for Speech Separation and Enhancement | Nov 18, 2019 | Speaker SeparationSpeech Enhancement | —Unverified | 0 |