Short-Term Memory Convolutions Feb 8, 2023 Acoustic Scene Classification Scene Classification
— Unverified 0Should We Always Separate?: Switching Between Enhanced and Observed Signals for Overlapping Speech Recognition Jun 2, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Simultaneous Denoising and Dereverberation Using Deep Embedding Features Apr 6, 2020 Clustering Deep Clustering
— Unverified 0Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios Jun 17, 2022 Action Detection Activity Detection
— Unverified 0Single-Channel Multi-talker Speech Recognition with Permutation Invariant Training Jul 19, 2017 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Single-channel speech separation using Soft-minimum Permutation Invariant Training Nov 16, 2021 Speech Separation
— Unverified 0Single-Channel Speech Separation with Auxiliary Speaker Embeddings Jun 24, 2019 Speech Separation
— Unverified 0Single-Channel Target Speech Extraction Utilizing Distance and Room Clues May 20, 2025 Speech Extraction Speech Separation
— Unverified 0Single-Microphone Speaker Separation and Voice Activity Detection in Noisy and Reverberant Environments Jan 7, 2024 Action Detection Activity Detection
— Unverified 0SkiM: Skipping Memory LSTM for Low-Latency Real-Time Continuous Speech Separation Jan 26, 2022 Speech Separation
— Unverified 0Sound Signal Processing with Seq2Tree Network May 1, 2018 Speech Separation
— Unverified 0Sparsely Overlapped Speech Training in the Time Domain: Joint Learning of Target Speech Separation and Personal VAD Benefits Jun 28, 2021 Speech Separation
— Unverified 0Spatial and spectral deep attention fusion for multi-channel speech separation using deep embedding features Feb 5, 2020 Clustering Deep Attention
— Unverified 0Permutation Invariant Training of Deep Models for Speaker-Independent Multi-talker Speech Separation Jul 1, 2016 Clustering Deep Clustering
Code Code Available 0Exploring Self-Attention Mechanisms for Speech Separation Feb 6, 2022 Denoising Speech Enhancement
Code Code Available 0Interrupted and cascaded permutation invariant training for speech separation Oct 28, 2019 Speech Separation
Code Code Available 0Unsupervised Deep Clustering for Source Separation: Direct Learning from Mixtures using Spatial Information Nov 5, 2018 Clustering Deep Clustering
Code Code Available 0Analysis of impact of emotions on target speech extraction and speech separation Aug 15, 2022 Speaker Verification Speech Extraction
Code Code Available 0Deep attractor network for single-microphone speaker separation Nov 27, 2016 Speaker Separation Speech Separation
Code Code Available 0Beyond Speaker Identity: Text Guided Target Speech Extraction Jan 15, 2025 Speech Extraction Speech Separation
Code Code Available 0An enhanced Conv-TasNet model for speech separation using a speaker distance-based loss function May 26, 2022 Speech Separation
Code Code Available 0EEND-SS: Joint End-to-End Neural Speaker Diarization and Speech Separation for Flexible Number of Speakers Mar 31, 2022 Decoder speaker-diarization
Code Code Available 0REAL-M: Towards Speech Separation on Real Mixtures Oct 20, 2021 Open-Ended Question Answering Speech Separation
Code Code Available 0Real-time Single-channel Dereverberation and Separation with Time-domainAudio Separation Network Sep 2, 2018 Denoising Speech Dereverberation
Code Code Available 0CSLNSpeech: solving extended speech separation problem with the help of Chinese sign language Jul 21, 2020 Self-Supervised Learning Speech Separation
Code Code Available 0Improving Voice Separation by Incorporating End-to-end Speech Recognition Nov 29, 2019 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Improved Speech Separation with Time-and-Frequency Cross-domain Joint Embedding and Clustering Apr 16, 2019 Clustering Speech Separation
Code Code Available 0A Multi-Phase Gammatone Filterbank for Speech Separation via TasNet Oct 25, 2019 Low-latency processing Speech Separation
Code Code Available 0Divide and Conquer: A Deep CASA Approach to Talker-independent Monaural Speaker Separation Apr 25, 2019 Clustering Speaker Separation
Code Code Available 0Resource-Efficient Separation Transformer Jun 19, 2022 Speech Separation
Code Code Available 0TasNet: time-domain audio separation network for real-time, single-channel speech separation Nov 1, 2017 Decoder Speech Separation
Code Code Available 0Disentangling the Impacts of Language and Channel Variability on Speech Separation Networks Mar 30, 2022 Speech Separation
Code Code Available 0Speaker Extraction with Co-Speech Gestures Cue Mar 31, 2022 Speech Separation
Code Code Available 0Alternative Objective Functions for Deep Clustering Apr 1, 2018 Clustering Deep Clustering
Code Code Available 0Onssen: an open-source speech separation and enhancement library Nov 3, 2019 Deep Clustering speech-recognition
Code Code Available 0Multi-talker Speech Separation with Utterance-level Permutation Invariant Training of Deep Recurrent Neural Networks Mar 18, 2017 Clustering Deep Clustering
Code Code Available 0Two-Step Sound Source Separation: Training on Learned Latent Targets Oct 22, 2019 Speech Separation Vocal Bursts Valence Prediction
Code Code Available 0WHAM!: Extending Speech Separation to Noisy Environments Jul 2, 2019 Speech Separation
Code Code Available 0Multi-Decoder DPRNN: High Accuracy Source Counting and Separation Nov 24, 2020 Decoder Speech Separation
Code Code Available 0Deep Recurrent NMF for Speech Separation by Unfolding Iterative Thresholding Sep 21, 2017 Speech Separation
Code Code Available 0Identify Speakers in Cocktail Parties with End-to-End Attention May 22, 2020 Speaker Identification Speech Separation
Code Code Available 0Deep learning for monaural speech separation May 4, 2014 Deep Learning Multi-Speaker Source Separation
Code Code Available 0Many-Speakers Single Channel Speech Separation with Optimal Permutation Training Apr 18, 2021 Speech Separation
Code Code Available 0Semi-Supervised Monaural Singing Voice Separation With a Masking Network Trained on Synthetic Mixtures Dec 14, 2018 Music Source Separation Speech Separation
Code Code Available 0Deep Karaoke: Extracting Vocals from Musical Mixtures Using a Convolutional Deep Neural Network Apr 17, 2015 Speech Separation
Code Code Available 0Speech Separation with Pretrained Frontend to Minimize Domain Mismatch Nov 5, 2024 Speech Separation
Code Code Available 0Looking to Listen at the Cocktail Party: A Speaker-Independent Audio-Visual Model for Speech Separation Apr 10, 2018 Speech Separation
Code Code Available 0Filterbank design for end-to-end speech separation Oct 23, 2019 Speaker Recognition Speech Separation
Code Code Available 0SPGM: Prioritizing Local Features for enhanced speech separation performance Sep 22, 2023 Speech Separation
Code Code Available 0Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments Nov 6, 2018 Speech Enhancement Speech Separation
Code Code Available 0