| Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation | Sep 20, 2018 | Multi-task Audio Source SeperationMusic Source Separation | CodeCode Available | 3 | 5 |
| VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking | Oct 11, 2018 | Speaker RecognitionSpeaker Separation | CodeCode Available | 2 | 5 |
| SepMamba: State-space models for speaker separation using Mamba | Oct 28, 2024 | MambaSpeaker Separation | CodeCode Available | 1 | 5 |
| Single-Channel Multi-Speaker Separation using Deep Clustering | Jul 7, 2016 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Multi-microphone Complex Spectral Mapping for Utterance-wise and Continuous Speech Separation | Oct 4, 2020 | Speaker SeparationSpeech Separation | CodeCode Available | 1 | 5 |
| Blind Speech Separation and Dereverberation using Neural Beamforming | Mar 24, 2021 | Speaker IdentificationSpeaker Separation | CodeCode Available | 1 | 5 |
| AV-CrossNet: an Audiovisual Complex Spectral Mapping Network for Speech Separation By Leveraging Narrow- and Cross-Band Modeling | Jun 17, 2024 | Speaker SeparationSpeech Enhancement | CodeCode Available | 1 | 5 |
| Stabilizing Label Assignment for Speech Separation by Self-supervised Pre-training | Oct 29, 2020 | Speaker SeparationSpeech Enhancement | CodeCode Available | 1 | 5 |
| Speech Separation Based on Multi-Stage Elaborated Dual-Path Deep BiLSTM with Auxiliary Identity Loss | Aug 6, 2020 | Speaker SeparationSpeech Separation | CodeCode Available | 1 | 5 |
| Deep attractor network for single-microphone speaker separation | Nov 27, 2016 | Speaker SeparationSpeech Separation | CodeCode Available | 0 | 5 |
| Monaural Audio Speaker Separation with Source Contrastive Estimation | May 12, 2017 | ClusteringDescriptive | CodeCode Available | 0 | 5 |
| Divide and Conquer: A Deep CASA Approach to Talker-independent Monaural Speaker Separation | Apr 25, 2019 | ClusteringSpeaker Separation | CodeCode Available | 0 | 5 |
| Neural separation of observed and unobserved distributions | Nov 30, 2018 | Speaker Separation | CodeCode Available | 0 | 5 |
| High Fidelity Speech Regeneration with Application to Speech Enhancement | Jan 31, 2021 | DenoisingSpeaker Separation | —Unverified | 0 | 0 |
| Independent Vector Extraction Constrained on Manifold of Half-Length Filters | Apr 4, 2023 | Speaker Separation | —Unverified | 0 | 0 |
| Individualized Conditioning and Negative Distances for Speaker Separation | Oct 12, 2022 | Speaker SeparationTriplet | —Unverified | 0 | 0 |
| Interactive Speech and Noise Modeling for Speech Enhancement | Dec 17, 2020 | DiversitySpeaker Separation | —Unverified | 0 | 0 |
| Learning-based Robust Speaker Counting and Separation with the Aid of Spatial Coherence | Mar 13, 2023 | Speaker SeparationSpeech Separation | —Unverified | 0 | 0 |
| Location-based training for multi-channel talker-independent speaker separation | Oct 8, 2021 | Speaker Separation | —Unverified | 0 | 0 |
| Mixture to Mixture: Leveraging Close-talk Mixtures as Weak-supervision for Speech Separation | Feb 14, 2024 | Speaker SeparationSpeech Separation | —Unverified | 0 | 0 |
| Multi-channel Conversational Speaker Separation via Neural Diarization | Nov 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Multi-Channel Target Speaker Extraction with Refinement: The WavLab Submission to the Second Clarity Enhancement Challenge | Feb 15, 2023 | Speaker SeparationSpeech Enhancement | —Unverified | 0 | 0 |
| Multi-Microphone Speaker Separation by Spatial Regions | Mar 13, 2023 | Speaker Separation | —Unverified | 0 | 0 |
| Multiple Speaker Separation from Noisy Sources in Reverberant Rooms using Relative Transfer Matrix | Mar 12, 2025 | Speaker Separation | —Unverified | 0 | 0 |
| Multi-resolution location-based training for multi-channel continuous speech separation | Jan 16, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| EEG-based Auditory Attention Decoding: Towards Neuro-Steered Hearing Devices | Aug 11, 2020 | EEGElectroencephalogram (EEG) | —Unverified | 0 | 0 |
| New Insights on Target Speaker Extraction | Feb 1, 2022 | Speaker SeparationTarget Speaker Extraction | —Unverified | 0 | 0 |
| Online Binaural Speech Separation of Moving Speakers With a Wavesplit Network | Mar 13, 2023 | Online ClusteringSpeaker Separation | —Unverified | 0 | 0 |
| Online Self-Attentive Gated RNNs for Real-Time Speaker Separation | Jun 25, 2021 | blind source separationSpeaker Separation | —Unverified | 0 | 0 |
| On permutation invariant training for speech source separation | Feb 9, 2021 | ClusteringSpeaker Separation | —Unverified | 0 | 0 |
| Personalized Keyphrase Detection using Speaker and Environment Information | Apr 28, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Phasebook and Friends: Leveraging Discrete Representations for Source Separation | Oct 2, 2018 | Speaker SeparationSpeech Enhancement | —Unverified | 0 | 0 |
| Practical applicability of deep neural networks for overlapping speaker separation | Dec 19, 2019 | ClusteringDeep Clustering | —Unverified | 0 | 0 |
| Quantitative Evidence on Overlooked Aspects of Enrollment Speaker Embeddings for Target Speaker Separation | Oct 23, 2022 | Speaker IdentificationSpeaker Separation | —Unverified | 0 | 0 |
| Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction | Oct 30, 2023 | Speaker SeparationSpeech Enhancement | —Unverified | 0 | 0 |
| SC-SOT: Conditioning the Decoder on Diarized Speaker Information for End-to-End Overlapped Speech Recognition | Jun 15, 2025 | Decoderspeaker-diarization | —Unverified | 0 | 0 |
| Seeing Through Noise: Visually Driven Speaker Separation and Enhancement | Aug 22, 2017 | Speaker Separation | —Unverified | 0 | 0 |
| Single-Microphone Speaker Separation and Voice Activity Detection in Noisy and Reverberant Environments | Jan 7, 2024 | Action DetectionActivity Detection | —Unverified | 0 | 0 |
| Spatial-Temporal Activity-Informed Diarization and Separation | Jan 30, 2024 | speaker-diarizationSpeaker Diarization | —Unverified | 0 | 0 |
| Speaker Separation Using Speaker Inventories and Estimated Speech | Oct 20, 2020 | Speaker SeparationSpeech Extraction | —Unverified | 0 | 0 |
| SuperM2M: Supervised and Mixture-to-Mixture Co-Learning for Speech Enhancement and Noise-Robust ASR | Mar 15, 2024 | Speaker SeparationSpeech Enhancement | —Unverified | 0 | 0 |
| Supervised Speech Separation Based on Deep Learning: An Overview | Aug 24, 2017 | Deep LearningSpeaker Separation | —Unverified | 0 | 0 |
| Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches | Apr 4, 2022 | blind source separationMetric Learning | —Unverified | 0 | 0 |
| UNSSOR: Unsupervised Neural Speech Separation by Leveraging Over-determined Training Mixtures | May 31, 2023 | Speaker SeparationSpeech Separation | —Unverified | 0 | 0 |
| A Comparative Study on Multichannel Speaker-Attributed Automatic Speech Recognition in Multi-party Meetings | Nov 1, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| VoiceVector: Multimodal Enrolment Vectors for Speaker Separation | Jan 2, 2025 | Speaker Separation | —Unverified | 0 | 0 |
| A Comparative Study on Speaker-attributed Automatic Speech Recognition in Multi-party Meetings | Mar 31, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Sequential Multi-Frame Neural Beamforming for Speech Separation and Enhancement | Nov 18, 2019 | Speaker SeparationSpeech Enhancement | —Unverified | 0 | 0 |
| Auditory Separation of a Conversation from Background via Attentional Gating | May 26, 2019 | Speaker Separation | —Unverified | 0 | 0 |
| Auxiliary Interference Speaker Loss for Target-Speaker Speech Recognition | Jun 26, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |