SepMamba: State-space models for speaker separation using Mamba Oct 28, 2024 Mamba Speaker Separation
Code Code Available 15 Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation Feb 22, 2023 Multi-Task Learning Speech Enhancement
Code Code Available 15 Noise-robust Speech Separation with Fast Generative Correction Jun 11, 2024 Speech Separation
Code Code Available 15 SepPrune: Structured Pruning for Efficient Deep Speech Separation May 17, 2025 channel selection Computational Efficiency
Code Code Available 15 VisualVoice: Audio-Visual Speech Separation with Cross-Modal Consistency Jan 8, 2021 Speech Separation
Code Code Available 15 GEV Beamforming Supported by DOA-based Masks Generated on Pairs of Microphones May 19, 2020 speech-recognition Speech Recognition
Code Code Available 15 Attention is All You Need in Speech Separation Oct 25, 2020 All Speech Separation
Code Code Available 15 Enhanced Reverberation as Supervision for Unsupervised Speech Separation Aug 6, 2024 Speech Separation
Code Code Available 15 MESH2IR: Neural Acoustic Impulse Response Generator for Complex 3D Scenes May 18, 2022 2k CPU
Code Code Available 15 Compute and memory efficient universal sound source separation Mar 3, 2021 Audio Source Separation Efficient Neural Network
Code Code Available 15 TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion Jan 25, 2024 speech-recognition Speech Recognition
Code Code Available 15 Continuous speech separation: dataset and analysis Jan 30, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Effective Low-Cost Time-Domain Audio Separation Using Globally Attentive Locally Recurrent Networks Jan 13, 2021 Speech Separation
Code Code Available 15 Continuous Speech Separation with Conformer Aug 13, 2020 Speech Separation
Code Code Available 15 Graph-PIT: Generalized permutation invariant training for continuous separation of arbitrary numbers of speakers Jul 30, 2021 Speech Separation
Code Code Available 15 A cappella: Audio-visual Singing Voice Separation Apr 20, 2021 Music Source Separation Speech Separation
Code Code Available 15 An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits Dec 21, 2022 Speech Separation
Code Code Available 15 End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation Oct 30, 2019 Speech Separation
Code Code Available 15 Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation Oct 14, 2019 Speech Separation
Code Code Available 15 RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation Sep 29, 2023 Audio-Visual Speech Recognition speech-recognition
Code Code Available 15 Text-aware Speech Separation for Multi-talker Keyword Spotting Jun 18, 2024 Keyword Spotting Speech Separation
Code Code Available 15 Online speaker diarization of meetings guided by speech separation Jan 30, 2024 Action Detection Activity Detection
Code Code Available 15 The Cone of Silence: Speech Separation by Localization Oct 12, 2020 Audio Source Separation Speech Separation
Code Code Available 15 Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam Jan 23, 2020 Speaker Identification Speech Extraction
Code Code Available 15 Speech Separation with Pretrained Frontend to Minimize Domain Mismatch Nov 5, 2024 Speech Separation
Code Code Available 05 Complementing Handcrafted Features with Raw Waveform Using a Light-weight Auxiliary Model Sep 6, 2021 speech-recognition Speech Recognition
Code Code Available 05 Analysis of impact of emotions on target speech extraction and speech separation Aug 15, 2022 Speaker Verification Speech Extraction
Code Code Available 05 SPGM: Prioritizing Local Features for enhanced speech separation performance Sep 22, 2023 Speech Separation
Code Code Available 05 Speaker Extraction with Co-Speech Gestures Cue Mar 31, 2022 Speech Separation
Code Code Available 05 CasNet: Investigating Channel Robustness for Speech Separation Oct 27, 2022 Speech Separation
Code Code Available 05 Singing Voice Separation with Deep U-Net Convolutional Networks Oct 27, 2017 Speech Separation Translation
Code Code Available 05 ADL-MVDR: All deep learning MVDR beamformer for target speech separation Aug 16, 2020 All Speech Separation
Code Code Available 05 A Multi-Phase Gammatone Filterbank for Speech Separation via TasNet Oct 25, 2019 Low-latency processing Speech Separation
Code Code Available 05 Divide and Conquer: A Deep CASA Approach to Talker-independent Monaural Speaker Separation Apr 25, 2019 Clustering Speaker Separation
Code Code Available 05 Beyond Speaker Identity: Text Guided Target Speech Extraction Jan 15, 2025 Speech Extraction Speech Separation
Code Code Available 05 Semi-Supervised Monaural Singing Voice Separation With a Masking Network Trained on Synthetic Mixtures Dec 14, 2018 Music Source Separation Speech Separation
Code Code Available 05 CSLNSpeech: solving extended speech separation problem with the help of Chinese sign language Jul 21, 2020 Self-Supervised Learning Speech Separation
Code Code Available 05 Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks Mar 30, 2022 Speech Separation
Code Code Available 05 Permutation Invariant Training of Deep Models for Speaker-Independent Multi-talker Speech Separation Jul 1, 2016 Clustering Deep Clustering
Code Code Available 05 REAL-M: Towards Speech Separation on Real Mixtures Oct 20, 2021 Open-Ended Question Answering Speech Separation
Code Code Available 05 Onssen: an open-source speech separation and enhancement library Nov 3, 2019 Deep Clustering speech-recognition
Code Code Available 05 Exploring Self-Attention Mechanisms for Speech Separation Feb 6, 2022 Denoising Speech Enhancement
Code Code Available 05 Real-time Single-channel Dereverberation and Separation with Time-domainAudio Separation Network Sep 2, 2018 Denoising Speech Dereverberation
Code Code Available 05 Multi-talker Speech Separation with Utterance-level Permutation Invariant Training of Deep Recurrent Neural Networks Mar 18, 2017 Clustering Deep Clustering
Code Code Available 05 Deep Recurrent NMF for Speech Separation by Unfolding Iterative Thresholding Sep 21, 2017 Speech Separation
Code Code Available 05 An enhanced Conv-TasNet model for speech separation using a speaker distance-based loss function May 26, 2022 Speech Separation
Code Code Available 05 Alternative Objective Functions for Deep Clustering Apr 1, 2018 Clustering Deep Clustering
Code Code Available 05 Deep learning for monaural speech separation May 4, 2014 Deep Learning Multi-Speaker Source Separation
Code Code Available 05 Multi-Decoder DPRNN: High Accuracy Source Counting and Separation Nov 24, 2020 Decoder Speech Separation
Code Code Available 05 Resource-Efficient Separation Transformer Jun 19, 2022 Speech Separation
Code Code Available 05