End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations Mar 21, 2023 Action Detection Activity Detection
— Unverified 0Towards Real-Time Single-Channel Speech Separation in Noisy and Reverberant Environments Mar 14, 2023 Decoder Speech Separation
— Unverified 0Online Binaural Speech Separation of Moving Speakers With a Wavesplit Network Mar 13, 2023 Online Clustering Speaker Separation
— Unverified 0Learning-based Robust Speaker Counting and Separation with the Aid of Spatial Coherence Mar 13, 2023 Speaker Separation Speech Separation
— Unverified 0A Multi-Stage Triple-Path Method for Speech Separation in Noisy and Reverberant Environments Mar 7, 2023 Denoising Speech Denoising
— Unverified 0Multi-Dimensional and Multi-Scale Modeling for Speech Separation Optimized by Discriminative Learning Mar 7, 2023 Speech Separation
— Unverified 0Scaling strategies for on-device low-complexity source separation with Conv-Tasnet Mar 6, 2023 Speech Separation
— Unverified 0MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-Head Transformer with Convolution-Augmented Joint Self-Attentions Feb 23, 2023 Speech Separation
Code Code Available 1Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation Feb 22, 2023 Multi-Task Learning Speech Enhancement
Code Code Available 1Deep AHS: A Deep Learning Approach to Acoustic Howling Suppression Feb 18, 2023 Deep Learning Speech Separation
— Unverified 0Short-Term Memory Convolutions Feb 8, 2023 Acoustic Scene Classification Scene Classification
— Unverified 0Separate And Diffuse: Using a Pretrained Diffusion Model for Improving Source Separation Jan 25, 2023 Audio Source Separation Generalization Bounds
— Unverified 0Improving Target Speaker Extraction with Sparse LDA-transformed Speaker Embeddings Jan 16, 2023 Speaker Verification Speech Separation
— Unverified 0Multi-resolution location-based training for multi-channel continuous speech separation Jan 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0An Audio-Visual Speech Separation Model Inspired by Cortico-Thalamo-Cortical Circuits Dec 21, 2022 Speech Separation
Code Code Available 1MIMO-DBnet: Multi-channel Input and Multiple Outputs DOA-aware Beamforming Network for Speech Separation Dec 7, 2022 Speech Separation
— Unverified 0Deep neural network techniques for monaural speech enhancement: state of the art analysis Dec 1, 2022 Art Analysis Image Generation
— Unverified 0Deep Neural Mel-Subband Beamformer for In-car Speech Separation Nov 22, 2022 Speech Separation
— Unverified 0Self-Remixing: Unsupervised Speech Separation via Separation and Remixing Nov 18, 2022 Domain Adaptation Semi-supervised Domain Adaptation
— Unverified 0Reverberation as Supervision for Speech Separation Nov 15, 2022 Speech Separation
— Unverified 0Handling Trade-Offs in Speech Separation with Sparsely-Gated Mixture of Experts Nov 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0An Adapter based Multi-label Pre-training for Speech Separation and Enhancement Nov 11, 2022 Denoising Pseudo Label
— Unverified 0Speech separation with large-scale self-supervised learning Nov 9, 2022 Self-Supervised Learning Speech Separation
— Unverified 0Spatially Selective Deep Non-linear Filters for Speaker Extraction Nov 4, 2022 Speech Separation
— Unverified 0SCA: Streaming Cross-attention Alignment for Echo Cancellation Nov 1, 2022 Speech Enhancement Speech Separation
— Unverified 0Audio-Visual Speech Enhancement and Separation by Utilizing Multi-Modal Self-Supervised Embeddings Oct 31, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CasNet: Investigating Channel Robustness for Speech Separation Oct 27, 2022 Speech Separation
Code Code Available 0Deformable Temporal Convolutional Networks for Monaural Noisy Reverberant Speech Separation Oct 27, 2022 Speech Dereverberation Speech Separation
Code Code Available 1Provable Subspace Identification Under Post-Nonlinear Mixtures Oct 14, 2022 Causal Discovery Speech Separation
— Unverified 0OCD: Learning to Overfit with Conditional Diffusion Models Oct 2, 2022 3D Reconstruction Denoising
Code Code Available 1An efficient encoder-decoder architecture with top-down attention for speech separation Sep 30, 2022 CPU
Code Code Available 2CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement Sep 22, 2022 Audio Super-Resolution Automatic Speech Recognition
Code Code Available 2VarArray Meets t-SOT: Advancing the State of the Art of Streaming Distant Conversational Speech Recognition Sep 12, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Streaming Target-Speaker ASR with Neural Transducer Sep 9, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Analysis of impact of emotions on target speech extraction and speech separation Aug 15, 2022 Speaker Verification Speech Extraction
Code Code Available 0Recycling an anechoic pre-trained speech separation deep neural network for binaural dereverberation of a single source Aug 9, 2022 Speech Separation
— Unverified 0Conv-NILM-Net, a causal and multi-appliance model for energy source separation Aug 3, 2022 Non-Intrusive Load Monitoring Speech Separation
— Unverified 0ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding Jul 19, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0SATTS: Speaker Attractor Text to Speech, Learning to Speak by Learning to Separate Jul 13, 2022 Speech Separation text-to-speech
— Unverified 0Dual-Path Cross-Modal Attention for better Audio-Visual Speech Extraction Jul 9, 2022 Speech Extraction Speech Separation
— Unverified 0Multi-Modal Multi-Correlation Learning for Audio-Visual Speech Separation Jul 4, 2022 Contrastive Learning Speech Separation
— Unverified 0Efficient Transformer-based Speech Enhancement Using Long Frames and STFT Magnitudes Jun 23, 2022 Speech Enhancement Speech Separation
— Unverified 0Resource-Efficient Separation Transformer Jun 19, 2022 Speech Separation
Code Code Available 0Simultaneous Speech Extraction for Multiple Target Speakers under the Meeting Scenarios Jun 17, 2022 Action Detection Activity Detection
— Unverified 0AmbiSep: Ambisonic-to-Ambisonic Reverberant Speech Separation Using Transformer Networks Jun 13, 2022 Speech Separation
— Unverified 0Conversational Speech Separation: an Evaluation Study for Streaming Applications May 31, 2022 Speech Separation
— Unverified 0An enhanced Conv-TasNet model for speech separation using a speaker distance-based loss function May 26, 2022 Speech Separation
Code Code Available 0SepIt: Approaching a Single Channel Speech Separation Bound May 24, 2022 Audio Source Separation Generalization Bounds
— Unverified 0MESH2IR: Neural Acoustic Impulse Response Generator for Complex 3D Scenes May 18, 2022 2k CPU
Code Code Available 1Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech May 10, 2022 Segmentation speech-recognition
— Unverified 0