Separate and Reconstruct: Asymmetric Encoder-Decoder for Speech Separation Jun 10, 2024 Chunking Speech Separation
Code Code Available 35 Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Sep 20, 2018 Multi-task Audio Source Seperation Music Source Separation
Code Code Available 35 SPMamba: State-space model is all you need in speech separation Apr 2, 2024 All Mamba
Code Code Available 35 SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline May 25, 2025 Speech Extraction Speech Separation
Code Code Available 35 SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios Oct 2, 2024 Speech Enhancement Speech Separation
Code Code Available 35 An efficient encoder-decoder architecture with top-down attention for speech separation Sep 30, 2022 CPU
Code Code Available 25 Dual-path Mamba: Short and Long-term Bidirectional Selective Structured State Space Models for Speech Separation Mar 27, 2024 Mamba Speech Separation
Code Code Available 25 CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement Sep 22, 2022 Audio Super-Resolution Automatic Speech Recognition
Code Code Available 25 Voice Separation with an Unknown Number of Multiple Speakers Feb 29, 2020 Speech Separation
Code Code Available 25 PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings Mar 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 25 Target Speaker ASR with Whisper Sep 14, 2024 Speech Separation
Code Code Available 25 Speech Slytherin: Examining the Performance and Efficiency of Mamba for Speech Separation, Recognition, and Synthesis Jul 13, 2024 Mamba speech-recognition
Code Code Available 25 TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement Aug 6, 2024 Speech Enhancement Speech Separation
Code Code Available 25 IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation Aug 16, 2023 Speech Separation
Code Code Available 15 Graph-PIT: Generalized permutation invariant training for continuous separation of arbitrary numbers of speakers Jul 30, 2021 Speech Separation
Code Code Available 15 Stabilizing Label Assignment for Speech Separation by Self-supervised Pre-training Oct 29, 2020 Speaker Separation Speech Enhancement
Code Code Available 15 Enhanced Reverberation as Supervision for Unsupervised Speech Separation Aug 6, 2024 Speech Separation
Code Code Available 15 Effective Low-Cost Time-Domain Audio Separation Using Globally Attentive Locally Recurrent Networks Jan 13, 2021 Speech Separation
Code Code Available 15 Papez: Resource-Efficient Speech Separation with Auditory Working Memory Jul 1, 2024 Speech Separation
Code Code Available 15 RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation Sep 29, 2023 Audio-Visual Speech Recognition speech-recognition
Code Code Available 15 DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And Extraction Dec 27, 2021 Speech Extraction Speech Separation
Code Code Available 15 A Neural State-Space Model Approach to Efficient Speech Separation May 26, 2023 Representation Learning Speech Separation
Code Code Available 15 Annealed Multiple Choice Learning: Overcoming limitations of Winner-takes-all with annealing Jul 22, 2024 All Diversity
Code Code Available 15 Group Communication with Context Codec for Lightweight Source Separation Dec 14, 2020 Decoder Speech Enhancement
Code Code Available 15 OCD: Learning to Overfit with Conditional Diffusion Models Oct 2, 2022 3D Reconstruction Denoising
Code Code Available 15 End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation Oct 30, 2019 Speech Separation
Code Code Available 15 A Time-domain Real-valued Generalized Wiener Filter for Multi-channel Neural Separation Systems Dec 7, 2021 Speech Separation
Code Code Available 15 Attention is All You Need in Speech Separation Oct 25, 2020 All Speech Separation
Code Code Available 15 GEV Beamforming Supported by DOA-based Masks Generated on Pairs of Microphones May 19, 2020 speech-recognition Speech Recognition
Code Code Available 15 Sandglasset: A Light Multi-Granularity Self-attentive Network For Time-Domain Speech Separation Mar 1, 2021 Computational Efficiency Speech Separation
Code Code Available 15 Multi-Task Audio Source Separation Jul 14, 2021 Audio Source Separation Multi-task Audio Source Seperation
Code Code Available 15 Distributed speech separation in spatially unconstrained microphone arrays Nov 2, 2020 Diversity Speech Separation
Code Code Available 15 Noise-robust Speech Separation with Fast Generative Correction Jun 11, 2024 Speech Separation
Code Code Available 15 Online speaker diarization of meetings guided by speech separation Jan 30, 2024 Action Detection Activity Detection
Code Code Available 15 Deep clustering: Discriminative embeddings for segmentation and separation Aug 18, 2015 Clustering Deep Clustering
Code Code Available 15 MESH2IR: Neural Acoustic Impulse Response Generator for Complex 3D Scenes May 18, 2022 2k CPU
Code Code Available 15 MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-Head Transformer with Convolution-Augmented Joint Self-Attentions Feb 23, 2023 Speech Separation
Code Code Available 15 Continuous speech separation: dataset and analysis Jan 30, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model May 31, 2023 Speech Separation
Code Code Available 15 Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation Oct 27, 2022 Speech Dereverberation Speech Separation
Code Code Available 15 Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation Oct 14, 2019 Speech Separation
Code Code Available 15 Directional Sparse Filtering using Weighted Lehmer Mean for Blind Separation of Unbalanced Speech Mixtures Jan 30, 2021 Audio Source Separation blind source separation
Code Code Available 15 Continuous Speech Separation with Conformer Aug 13, 2020 Speech Separation
Code Code Available 15 AV-CrossNet: an Audiovisual Complex Spectral Mapping Network for Speech Separation By Leveraging Narrow- and Cross-Band Modeling Jun 17, 2024 Speaker Separation Speech Enhancement
Code Code Available 15 Beam-Guided TasNet: An Iterative Speech Separation Framework with Multi-Channel Output Feb 5, 2021 blind source separation Speech Separation
Code Code Available 15 Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer Oct 23, 2020 Speech Separation
Code Code Available 15 Independent Vector Analysis with Deep Neural Network Source Priors Aug 23, 2020 Speech Separation
Code Code Available 15 Blind Speech Separation and Dereverberation using Neural Beamforming Mar 24, 2021 Speaker Identification Speaker Separation
Code Code Available 15 An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and Separation Aug 21, 2020 Deep Learning Speech Enhancement
Code Code Available 15 An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits Dec 21, 2022 Speech Separation
Code Code Available 15