Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition Jun 13, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multimodal Representation Loss Between Timed Text and Audio for Regularized Speech Separation Jun 12, 2024 Language Modeling Language Modelling
— Unverified 0Noise-robust Speech Separation with Fast Generative Correction Jun 11, 2024 Speech Separation
Code Code Available 1Separate and Reconstruct: Asymmetric Encoder-Decoder for Speech Separation Jun 10, 2024 Chunking Speech Separation
Code Code Available 3Cross-Talk Reduction May 30, 2024 Speech Separation
— Unverified 0Effects of Dataset Sampling Rate for Noise Cancellation through Deep Learning May 30, 2024 Speech Separation
— Unverified 0SPMamba: State-space model is all you need in speech separation Apr 2, 2024 All Mamba
Code Code Available 3Robust Active Speaker Detection in Noisy Environments Mar 27, 2024 Active Speaker Detection Speech Separation
— Unverified 0Dual-path Mamba: Short and Long-term Bidirectional Selective Structured State Space Models for Speech Separation Mar 27, 2024 Mamba Speech Separation
Code Code Available 2PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings Mar 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Probing Self-supervised Learning Models with Target Speech Extraction Feb 17, 2024 Self-Supervised Learning Speaker Identification
— Unverified 0Mixture to Mixture: Leveraging Close-talk Mixtures as Weak-supervision for Speech Separation Feb 14, 2024 Speaker Separation Speech Separation
— Unverified 0Online speaker diarization of meetings guided by speech separation Jan 30, 2024 Action Detection Activity Detection
Code Code Available 1TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion Jan 25, 2024 speech-recognition Speech Recognition
Code Code Available 1Boosting Unknown-number Speaker Separation with Transformer Decoder-based Attractor Jan 23, 2024 Decoder Speaker Separation
— Unverified 0Resource-constrained stereo singing voice cancellation Jan 22, 2024 Music Source Separation Speech Separation
— Unverified 0Multi-Input Multi-Output Target-Speaker Voice Activity Detection For Unified, Flexible, and Robust Audio-Visual Speaker Diarization Jan 16, 2024 Action Detection Activity Detection
— Unverified 0Single-Microphone Speaker Separation and Voice Activity Detection in Noisy and Reverberant Environments Jan 7, 2024 Action Detection Activity Detection
— Unverified 0Hyperbolic Distance-Based Speech Separation Jan 7, 2024 Speech Separation
— Unverified 0Improving Label Assignments Learning by Dynamic Sample Dropout Combined with Layer-wise Optimization in Speech Separation Nov 20, 2023 Speech Separation
— Unverified 0Seeing Through the Conversation: Audio-Visual Speech Separation based on Diffusion Model Oct 30, 2023 Speech Separation
— Unverified 0Real-time Speech Enhancement and Separation with a Unified Deep Neural Network for Single/Dual Talker Scenarios Oct 16, 2023 Speech Enhancement Speech Separation
— Unverified 0A Single Speech Enhancement Model Unifying Dereverberation, Denoising, Speaker Counting, Separation, and Extraction Oct 12, 2023 Denoising Speech Enhancement
— Unverified 0On Time Domain Conformer Models for Monaural Speech Separation in Noisy Reverberant Acoustic Environments Oct 9, 2023 Computational Efficiency Speech Separation
Code Code Available 1GASS: Generalizing Audio Source Separation with Large-scale Data Sep 29, 2023 Audio Source Separation Speech Separation
— Unverified 0RTFS-Net: Recurrent Time-Frequency Modelling for Efficient Audio-Visual Speech Separation Sep 29, 2023 Audio-Visual Speech Recognition speech-recognition
Code Code Available 1Meeting Recognition with Continuous Speech Separation and Transcription-Supported Diarization Sep 28, 2023 Sentence Speech Separation
— Unverified 0SPGM: Prioritizing Local Features for enhanced speech separation performance Sep 22, 2023 Speech Separation
Code Code Available 0Combining TF-GridNet and Mixture Encoder for Continuous Speech Separation for Meeting Transcription Sep 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition Aug 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation Aug 16, 2023 Speech Separation
Code Code Available 1Improving Deep Attractor Network by BGRU and GMM for Speech Separation Aug 7, 2023 Speech Separation
— Unverified 0Monaural Multi-Speaker Speech Separation Using Efficient Transformer Model Jul 29, 2023 Computational Efficiency Speech Separation
— Unverified 0Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation Jul 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Audio-visual End-to-end Multi-channel Speech Separation, Dereverberation and Recognition Jul 6, 2023 Speech Dereverberation Speech Enhancement
— Unverified 0Enhanced Neural Beamformer with Spatial Information for Target Speech Extraction Jun 28, 2023 Dimensionality Reduction Speech Extraction
— Unverified 0Mixture Encoder for Joint Speech Separation and Recognition Jun 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multi-Loss Convolutional Network with Time-Frequency Attention for Speech Enhancement Jun 15, 2023 Speech Enhancement Speech Separation
— Unverified 0An Efficient Speech Separation Network Based on Recurrent Fusion Dilated Convolution and Channel Attention Jun 9, 2023 Computational Efficiency Decoder
— Unverified 0Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Model May 31, 2023 Speech Separation
Code Code Available 1UNSSOR: Unsupervised Neural Speech Separation by Leveraging Over-determined Training Mixtures May 31, 2023 Speaker Separation Speech Separation
— Unverified 0An Experimental Review of Speaker Diarization methods with application to Two-Speaker Conversational Telephone Speech recordings May 29, 2023 Clustering speaker-diarization
— Unverified 0A Neural State-Space Model Approach to Efficient Speech Separation May 26, 2023 Representation Learning Speech Separation
Code Code Available 1Locate and Beamform: Two-dimensional Locating All-neural Beamformer for Multi-channel Speech Separation May 18, 2023 All Speech Separation
— Unverified 0Speech Separation based on Contrastive Learning and Deep Modularization May 18, 2023 Contrastive Learning Self-Supervised Learning
— Unverified 0Diffusion-based Signal Refiner for Speech Separation May 10, 2023 Denoising Speech Enhancement
— Unverified 0AudioSlots: A slot-centric generative model for audio separation May 9, 2023 blind source separation Decoder
— Unverified 0Deep Learning for Joint Acoustic Echo and Acoustic Howling Suppression in Hybrid Meetings May 2, 2023 Speech Separation
— Unverified 0Multi-channel Speech Separation Using Spatially Selective Deep Non-linear Filters Apr 24, 2023 Speech Separation
— Unverified 0On Data Sampling Strategies for Training Neural Network Speech Separation Models Apr 14, 2023 Speech Separation
— Unverified 0