Short-Term Memory Convolutions Feb 8, 2023 Acoustic Scene Classification Scene Classification
— Unverified 0Should We Always Separate?: Switching Between Enhanced and Observed Signals for Overlapping Speech Recognition Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Simultaneous Denoising and Dereverberation Using Deep Embedding Features Apr 6, 2020 Clustering Deep Clustering
— Unverified 0Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios Jun 17, 2022 Action Detection Activity Detection
— Unverified 0Single-Channel Multi-talker Speech Recognition with Permutation Invariant Training Jul 19, 2017 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Single-channel speech separation using Soft-minimum Permutation Invariant Training Nov 16, 2021 Speech Separation
— Unverified 0Single-Channel Speech Separation with Auxiliary Speaker Embeddings Jun 24, 2019 Speech Separation
— Unverified 0Single-Channel Target Speech Extraction Utilizing Distance and Room Clues May 20, 2025 Speech Extraction Speech Separation
— Unverified 0Single-Microphone Speaker Separation and Voice Activity Detection in Noisy and Reverberant Environments Jan 7, 2024 Action Detection Activity Detection
— Unverified 0SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech Separation Jan 26, 2022 Speech Separation
— Unverified 0Sound Signal Processing with Seq2Tree Network May 1, 2018 Speech Separation
— Unverified 0Sparsely Overlapped Speech Training in the Time Domain: Joint Learning of Target Speech Separation and Personal VAD Benefits Jun 28, 2021 Speech Separation
— Unverified 0Spatial and spectral deep attention fusion for multi-channel speech separation using deep embedding features Feb 5, 2020 Clustering Deep Attention
— Unverified 0Spatially Selective Deep Non-linear Filters for Speaker Extraction Nov 4, 2022 Speech Separation
— Unverified 0Speakerfilter-Pro: an improved target speaker extractor combines the time domain and frequency domain Oct 25, 2020 Speech Separation
— Unverified 0Speaker-independent Speech Separation with Deep Attractor Network Jul 12, 2017 Deep Learning Speech Separation
— Unverified 0Speech enhancement aided end-to-end multi-task learning for voice activity detection Oct 23, 2020 Action Detection Activity Detection
— Unverified 0Speech Separation based on Contrastive Learning and Deep Modularization May 18, 2023 Contrastive Learning Self-Supervised Learning
— Unverified 0Speech Separation using Neural Audio Codecs with Embedding Loss Nov 27, 2024 Speech Separation
— Unverified 0REAL-M: Towards Speech Separation on Real Mixtures Oct 20, 2021 Open-Ended Question Answering Speech Separation
Code Code Available 0Two-Step Sound Source Separation: Training on Learned Latent Targets Oct 22, 2019 Speech Separation Vocal Bursts Valence Prediction
Code Code Available 0WHAM!: Extending Speech Separation to Noisy Environments Jul 2, 2019 Speech Separation
Code Code Available 0Permutation Invariant Training of Deep Models for Speaker-Independent Multi-talker Speech Separation Jul 1, 2016 Clustering Deep Clustering
Code Code Available 0Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks Mar 30, 2022 Speech Separation
Code Code Available 0Interrupted and cascaded permutation invariant training for speech separation Oct 28, 2019 Speech Separation
Code Code Available 0Deep Recurrent NMF for Speech Separation by Unfolding Iterative Thresholding Sep 21, 2017 Speech Separation
Code Code Available 0Onssen: an open-source speech separation and enhancement library Nov 3, 2019 Deep Clustering speech-recognition
Code Code Available 0Semi-Supervised Monaural Singing Voice Separation With a Masking Network Trained on Synthetic Mixtures Dec 14, 2018 Music Source Separation Speech Separation
Code Code Available 0Deep learning for monaural speech separation May 4, 2014 Deep Learning Multi-Speaker Source Separation
Code Code Available 0Speech Separation with Pretrained Frontend to Minimize Domain Mismatch Nov 5, 2024 Speech Separation
Code Code Available 0Multi-talker Speech Separation with Utterance-level Permutation Invariant Training of Deep Recurrent Neural Networks Mar 18, 2017 Clustering Deep Clustering
Code Code Available 0Improving Voice Separation by Incorporating End-to-end Speech Recognition Nov 29, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0SPGM: Prioritizing Local Features for enhanced speech separation performance Sep 22, 2023 Speech Separation
Code Code Available 0Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering Apr 16, 2019 Clustering Speech Separation
Code Code Available 0An enhanced Conv-TasNet model for speech separation using a speaker distance-based loss function May 26, 2022 Speech Separation
Code Code Available 0Multi-Decoder DPRNN: High Accuracy Source Counting and Separation Nov 24, 2020 Decoder Speech Separation
Code Code Available 0Many-Speakers Single Channel Speech Separation with Optimal Permutation Training Apr 18, 2021 Speech Separation
Code Code Available 0ADL-MVDR: All deep learning MVDR beamformer for target speech separation Aug 16, 2020 All Speech Separation
Code Code Available 0Deep attractor network for single-microphone speaker separation Nov 27, 2016 Speaker Separation Speech Separation
Code Code Available 0WPD++: An Improved Neural Beamformer for Simultaneous Speech Separation and Dereverberation Nov 18, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Complementing Handcrafted Features with Raw Waveform Using a Light-weight Auxiliary Model Sep 6, 2021 speech-recognition Speech Recognition
Code Code Available 0Identify Speakers in Cocktail Parties with End-to-End Attention May 22, 2020 Speaker Identification Speech Separation
Code Code Available 0Singing Voice Separation with Deep U-Net Convolutional Networks Oct 27, 2017 Speech Separation Translation
Code Code Available 0Looking to Listen at the Cocktail Party: A Speaker-Independent Audio-Visual Model for Speech Separation Apr 10, 2018 Speech Separation
Code Code Available 0Unsupervised Deep Clustering for Source Separation: Direct Learning from Mixtures using Spatial Information Nov 5, 2018 Clustering Deep Clustering
Code Code Available 0Analyzing the impact of speaker localization errors on speech separation for automatic speech recognition Oct 24, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Deep Karaoke: Extracting Vocals from Musical Mixtures Using a Convolutional Deep Neural Network Apr 17, 2015 Speech Separation
Code Code Available 0CasNet: Investigating Channel Robustness for Speech Separation Oct 27, 2022 Speech Separation
Code Code Available 0Beyond Speaker Identity: Text Guided Target Speech Extraction Jan 15, 2025 Speech Extraction Speech Separation
Code Code Available 0Filterbank design for end-to-end speech separation Oct 23, 2019 Speaker Recognition Speech Separation
Code Code Available 0