SPMamba: State-space model is all you need in speech separation Apr 2, 2024 All Mamba
Code Code Available 3SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios Oct 2, 2024 Speech Enhancement Speech Separation
Code Code Available 3Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Sep 20, 2018 Multi-task Audio Source Seperation Music Source Separation
Code Code Available 3SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline May 25, 2025 Speech Extraction Speech Separation
Code Code Available 3Separate and Reconstruct: Asymmetric Encoder-Decoder for Speech Separation Jun 10, 2024 Chunking Speech Separation
Code Code Available 3An efficient encoder-decoder architecture with top-down attention for speech separation Sep 30, 2022 CPU
Code Code Available 2CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement Sep 22, 2022 Audio Super-Resolution Automatic Speech Recognition
Code Code Available 2Target Speaker ASR with Whisper Sep 14, 2024 Speech Separation
Code Code Available 2PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings Mar 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Dual-path Mamba: Short and Long-term Bidirectional Selective Structured State Space Models for Speech Separation Mar 27, 2024 Mamba Speech Separation
Code Code Available 2Speech Slytherin: Examining the Performance and Efficiency of Mamba for Speech Separation, Recognition, and Synthesis Jul 13, 2024 Mamba speech-recognition
Code Code Available 2TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement Aug 6, 2024 Speech Enhancement Speech Separation
Code Code Available 2Voice Separation with an Unknown Number of Multiple Speakers Feb 29, 2020 Speech Separation
Code Code Available 2IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation Aug 16, 2023 Speech Separation
Code Code Available 1A Time-domain Real-valued Generalized Wiener Filter for Multi-channel Neural Separation Systems Dec 7, 2021 Speech Separation
Code Code Available 1Stabilizing Label Assignment for Speech Separation by Self-supervised Pre-training Oct 29, 2020 Speaker Separation Speech Enhancement
Code Code Available 1On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments Oct 9, 2023 Computational Efficiency Speech Separation
Code Code Available 1Annealed Multiple Choice Learning: Overcoming limitations of Winner-takes-all with annealing Jul 22, 2024 All Diversity
Code Code Available 1Noise-robust Speech Separation with Fast Generative Correction Jun 11, 2024 Speech Separation
Code Code Available 1Papez: Resource-Efficient Speech Separation with Auditory Working Memory Jul 1, 2024 Speech Separation
Code Code Available 1MESH2IR: Neural Acoustic Impulse Response Generator for Complex 3D Scenes May 18, 2022 2k CPU
Code Code Available 1Low-Latency Speech Separation Guided Diarization for Telephone Conversations Apr 5, 2022 Action Detection Activity Detection
Code Code Available 1Blind Speech Separation and Dereverberation using Neural Beamforming Mar 24, 2021 Speaker Identification Speaker Separation
Code Code Available 1Multi-Task Audio Source Separation Jul 14, 2021 Audio Source Separation Multi-task Audio Source Seperation
Code Code Available 1OCD: Learning to Overfit with Conditional Diffusion Models Oct 2, 2022 3D Reconstruction Denoising
Code Code Available 1Online speaker diarization of meetings guided by speech separation Jan 30, 2024 Action Detection Activity Detection
Code Code Available 1MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-Head Transformer with Convolution-Augmented Joint Self-Attentions Feb 23, 2023 Speech Separation
Code Code Available 1Attention is All You Need in Speech Separation Oct 25, 2020 All Speech Separation
Code Code Available 1RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation Sep 29, 2023 Audio-Visual Speech Recognition speech-recognition
Code Code Available 1Sandglasset: A Light Multi-Granularity Self-attentive Network For Time-Domain Speech Separation Mar 1, 2021 Computational Efficiency Speech Separation
Code Code Available 1Group Communication with Context Codec for Lightweight Source Separation Dec 14, 2020 Decoder Speech Enhancement
Code Code Available 1Enhanced Reverberation as Supervision for Unsupervised Speech Separation Aug 6, 2024 Speech Separation
Code Code Available 1Improving speaker discrimination of target speech extraction with time-domain SpeakerBeam Jan 23, 2020 Speaker Identification Speech Extraction
Code Code Available 1Independent Vector Analysis with Deep Neural Network Source Priors Aug 23, 2020 Speech Separation
Code Code Available 1DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And Extraction Dec 27, 2021 Speech Extraction Speech Separation
Code Code Available 1Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer Oct 23, 2020 Speech Separation
Code Code Available 1Directional Sparse Filtering using Weighted Lehmer Mean for Blind Separation of Unbalanced Speech Mixtures Jan 30, 2021 Audio Source Separation blind source separation
Code Code Available 1Distributed speech separation in spatially unconstrained microphone arrays Nov 2, 2020 Diversity Speech Separation
Code Code Available 1Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model May 31, 2023 Speech Separation
Code Code Available 1End-to-end Microphone Permutation and Number Invariant Multi-channel Speech Separation Oct 30, 2019 Speech Separation
Code Code Available 1GEV Beamforming Supported by DOA-based Masks Generated on Pairs of Microphones May 19, 2020 speech-recognition Speech Recognition
Code Code Available 1Graph-PIT: Generalized permutation invariant training for continuous separation of arbitrary numbers of speakers Jul 30, 2021 Speech Separation
Code Code Available 1A Neural State-Space Model Approach to Efficient Speech Separation May 26, 2023 Representation Learning Speech Separation
Code Code Available 1AV-CrossNet: an Audiovisual Complex Spectral Mapping Network for Speech Separation By Leveraging Narrow- and Cross-Band Modeling Jun 17, 2024 Speaker Separation Speech Enhancement
Code Code Available 1Beam-Guided TasNet: An Iterative Speech Separation Framework with Multi-Channel Output Feb 5, 2021 blind source separation Speech Separation
Code Code Available 1LiMuSE: Lightweight Multi-modal Speaker Extraction Nov 7, 2021 Model Compression Quantization
Code Code Available 1Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation Oct 14, 2019 Speech Separation
Code Code Available 1Continuous Speech Separation with Conformer Aug 13, 2020 Speech Separation
Code Code Available 1An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and Separation Aug 21, 2020 Deep Learning Speech Enhancement
Code Code Available 1An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits Dec 21, 2022 Speech Separation
Code Code Available 1