SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline May 25, 2025 Speech Extraction Speech Separation
Code Code Available 3SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios Oct 2, 2024 Speech Enhancement Speech Separation
Code Code Available 3Separate and Reconstruct: Asymmetric Encoder-Decoder for Speech Separation Jun 10, 2024 Chunking Speech Separation
Code Code Available 3SPMamba: State-space model is all you need in speech separation Apr 2, 2024 All Mamba
Code Code Available 3Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Sep 20, 2018 Multi-task Audio Source Seperation Music Source Separation
Code Code Available 3Target Speaker ASR with Whisper Sep 14, 2024 Speech Separation
Code Code Available 2TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement Aug 6, 2024 Speech Enhancement Speech Separation
Code Code Available 2Speech Slytherin: Examining the Performance and Efficiency of Mamba for Speech Separation, Recognition, and Synthesis Jul 13, 2024 Mamba speech-recognition
Code Code Available 2Dual-path Mamba: Short and Long-term Bidirectional Selective Structured State Space Models for Speech Separation Mar 27, 2024 Mamba Speech Separation
Code Code Available 2PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings Mar 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2An efficient encoder-decoder architecture with top-down attention for speech separation Sep 30, 2022 CPU
Code Code Available 2CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement Sep 22, 2022 Audio Super-Resolution Automatic Speech Recognition
Code Code Available 2Voice Separation with an Unknown Number of Multiple Speakers Feb 29, 2020 Speech Separation
Code Code Available 2SepPrune: Structured Pruning for Efficient Deep Speech Separation May 17, 2025 channel selection Computational Efficiency
Code Code Available 1ArrayDPS: Unsupervised Blind Speech Separation with a Diffusion Prior May 8, 2025 Room Impulse Response (RIR) Speech Separation
Code Code Available 1VANPY: Voice Analysis Framework Feb 17, 2025 Action Detection Activity Detection
Code Code Available 1SepMamba: State-space models for speaker separation using Mamba Oct 28, 2024 Mamba Speaker Separation
Code Code Available 1USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction Sep 4, 2024 Speaker Recognition Speech Separation
Code Code Available 1Enhanced Reverberation as Supervision for Unsupervised Speech Separation Aug 6, 2024 Speech Separation
Code Code Available 1Annealed Multiple Choice Learning: Overcoming limitations of Winner-takes-all with annealing Jul 22, 2024 All Diversity
Code Code Available 1Papez: Resource-Efficient Speech Separation with Auditory Working Memory Jul 1, 2024 Speech Separation
Code Code Available 1Towards Audio Codec-based Speech Separation Jun 18, 2024 Edge-computing Speech Separation
Code Code Available 1Text-aware Speech Separation for Multi-talker Keyword Spotting Jun 18, 2024 Keyword Spotting Speech Separation
Code Code Available 1AV-CrossNet: an Audiovisual Complex Spectral Mapping Network for Speech Separation By Leveraging Narrow- and Cross-Band Modeling Jun 17, 2024 Speaker Separation Speech Enhancement
Code Code Available 1Noise-robust Speech Separation with Fast Generative Correction Jun 11, 2024 Speech Separation
Code Code Available 1Online speaker diarization of meetings guided by speech separation Jan 30, 2024 Action Detection Activity Detection
Code Code Available 1TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion Jan 25, 2024 speech-recognition Speech Recognition
Code Code Available 1On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments Oct 9, 2023 Computational Efficiency Speech Separation
Code Code Available 1RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation Sep 29, 2023 Audio-Visual Speech Recognition speech-recognition
Code Code Available 1IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation Aug 16, 2023 Speech Separation
Code Code Available 1Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model May 31, 2023 Speech Separation
Code Code Available 1A Neural State-Space Model Approach to Efficient Speech Separation May 26, 2023 Representation Learning Speech Separation
Code Code Available 1MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-Head Transformer with Convolution-Augmented Joint Self-Attentions Feb 23, 2023 Speech Separation
Code Code Available 1Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation Feb 22, 2023 Multi-Task Learning Speech Enhancement
Code Code Available 1An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits Dec 21, 2022 Speech Separation
Code Code Available 1Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation Oct 27, 2022 Speech Dereverberation Speech Separation
Code Code Available 1OCD: Learning to Overfit with Conditional Diffusion Models Oct 2, 2022 3D Reconstruction Denoising
Code Code Available 1MESH2IR: Neural Acoustic Impulse Response Generator for Complex 3D Scenes May 18, 2022 2k CPU
Code Code Available 1Low-Latency Speech Separation Guided Diarization for Telephone Conversations Apr 5, 2022 Action Detection Activity Detection
Code Code Available 1VoViT: Low Latency Graph-based Audio-Visual Voice Separation Transformer Mar 8, 2022 Speech Separation
Code Code Available 1MixCycle: Unsupervised Speech Separation via Cyclic Mixture Permutation Invariant Training Feb 8, 2022 Data Augmentation Speech Separation
Code Code Available 1DPCCN: Densely-Connected Pyramid Complex Convolutional Network for Robust Speech Separation And Extraction Dec 27, 2021 Speech Extraction Speech Separation
Code Code Available 1A Time-domain Real-valued Generalized Wiener Filter for Multi-channel Neural Separation Systems Dec 7, 2021 Speech Separation
Code Code Available 1LiMuSE: Lightweight Multi-modal Speaker Extraction Nov 7, 2021 Model Compression Quantization
Code Code Available 1Graph-PIT: Generalized permutation invariant training for continuous separation of arbitrary numbers of speakers Jul 30, 2021 Speech Separation
Code Code Available 1Multi-Task Audio Source Separation Jul 14, 2021 Audio Source Separation Multi-task Audio Source Seperation
Code Code Available 1A cappella: Audio-visual Singing Voice Separation Apr 20, 2021 Music Source Separation Speech Separation
Code Code Available 1Blind Speech Separation and Dereverberation using Neural Beamforming Mar 24, 2021 Speaker Identification Speaker Separation
Code Code Available 1Compute and memory efficient universal sound source separation Mar 3, 2021 Audio Source Separation Efficient Neural Network
Code Code Available 1Sandglasset: A Light Multi-Granularity Self-attentive Network For Time-Domain Speech Separation Mar 1, 2021 Computational Efficiency Speech Separation
Code Code Available 1