Dynamic Slimmable Networks for Efficient Speech Separation Jul 8, 2025 Speech Separation
— Unverified 0Improving Practical Aspects of End-to-End Multi-Talker Speech Recognition for Online and Offline Scenarios Jun 17, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline May 25, 2025 Speech Extraction Speech Separation
Code Code Available 3Attractor-Based Speech Separation of Multiple Utterances by Unknown Number of Speakers May 22, 2025 Speech Separation
— Unverified 0Single-Channel Target Speech Extraction Utilizing Distance and Room Clues May 20, 2025 Speech Extraction Speech Separation
— Unverified 0Time-Frequency-Based Attention Cache Memory Model for Real-Time Speech Separation May 19, 2025 Speech Separation
— Unverified 0SepPrune: Structured Pruning for Efficient Deep Speech Separation May 17, 2025 channel selection Computational Efficiency
Code Code Available 1A Survey of Deep Learning for Complex Speech Spectrograms May 13, 2025 Deep Learning Speech Enhancement
— Unverified 0ArrayDPS: Unsupervised Blind Speech Separation with a Diffusion Prior May 8, 2025 Room Impulse Response (RIR) Speech Separation
Code Code Available 1SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer May 7, 2025 Audio-Visual Speech Recognition Lip Reading
— Unverified 0SepALM: Audio Language Models Are Error Correctors for Robust Speech Separation May 6, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Passive Underwater Acoustic Signal Separation based on Feature Decoupling Dual-path Network Apr 11, 2025 Speech Separation
— Unverified 0Causal Self-supervised Pretrained Frontend with Predictive Code for Speech Separation Apr 3, 2025 Decoder Knowledge Distillation
— Unverified 0VANPY: Voice Analysis Framework Feb 17, 2025 Action Detection Activity Detection
Code Code Available 1EDSep: An Effective Diffusion-Based Method for Speech Source Separation Jan 27, 2025 Speech Separation
— Unverified 0Leveraging Spatial Cues from Cochlear Implant Microphones to Efficiently Enhance Speech Separation in Real-World Listening Scenes Jan 24, 2025 Speech Separation
— Unverified 0Beyond Speaker Identity: Text Guided Target Speech Extraction Jan 15, 2025 Speech Extraction Speech Separation
Code Code Available 0Reading to Listen at the Cocktail Party: Multi-Modal Speech Separation Jan 2, 2025 Sentence Speech Separation
— Unverified 0U-Mamba-Net: A highly efficient Mamba-based U-net style network for noisy and reverberant speech separation Dec 24, 2024 feature selection Mamba
— Unverified 0Speech Separation using Neural Audio Codecs with Embedding Loss Nov 27, 2024 Speech Separation
— Unverified 0Multiple Choice Learning for Efficient Speech Separation with Many Speakers Nov 27, 2024 Multiple-choice Speech Separation
— Unverified 0Study of the Performance of CEEMDAN in Underdetermined Speech Separation Nov 18, 2024 Audio Source Separation Speech Separation
— Unverified 0DCF-DS: Deep Cascade Fusion of Diarization and Separation for Speech Recognition under Realistic Single-Channel Conditions Nov 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speech Separation with Pretrained Frontend to Minimize Domain Mismatch Nov 5, 2024 Speech Separation
Code Code Available 0Task-Aware Unified Source Separation Oct 31, 2024 Audio Source Separation Music Source Separation
— Unverified 0SepMamba: State-space models for speaker separation using Mamba Oct 28, 2024 Mamba Speaker Separation
Code Code Available 1Mask-Weighted Spatial Likelihood Coding for Speaker-Independent Joint Localization and Mask Estimation Oct 25, 2024 Sound Source Localization Speech Separation
— Unverified 0STCON System for the CHiME-8 Challenge Oct 17, 2024 Data Augmentation Speech Separation
— Unverified 0TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation Oct 2, 2024 Speech Separation
— Unverified 0SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios Oct 2, 2024 Speech Enhancement Speech Separation
Code Code Available 3Wanna hear your voice? A sample is all we need! Oct 1, 2024 All Speech Separation
— Unverified 0Incorporating Spatial Cues in Modular Speaker Diarization for Multi-channel Multi-party Meetings Sep 25, 2024 Clustering speaker-diarization
— Unverified 0Target Speaker ASR with Whisper Sep 14, 2024 Speech Separation
Code Code Available 2DualSep: A Light-weight dual-encoder convolutional recurrent network for real-time in-car speech separation Sep 13, 2024 CPU Speech Separation
— Unverified 0USEF-TSE: Universal Speaker Embedding Free Target Speaker Extraction Sep 4, 2024 Speaker Recognition Speech Separation
Code Code Available 1LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization Sep 1, 2024 speaker-diarization Speaker Diarization
— Unverified 0Improving Generalization of Speech Separation in Real-World Scenarios: Strategies in Simulation, Optimization, and Evaluation Aug 28, 2024 Speech Separation
— Unverified 0Enhanced Reverberation as Supervision for Unsupervised Speech Separation Aug 6, 2024 Speech Separation
Code Code Available 1TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement Aug 6, 2024 Speech Enhancement Speech Separation
Code Code Available 2Annealed Multiple Choice Learning: Overcoming limitations of Winner-takes-all with annealing Jul 22, 2024 All Diversity
Code Code Available 1Robustness of Speech Separation Models for Similar-pitch Speakers Jul 22, 2024 speech-recognition Speech Recognition
— Unverified 0TalTech-IRIT-LIS Speaker and Language Diarization Systems for DISPLACE 2024 Jul 17, 2024 speaker-diarization Speaker Diarization
— Unverified 0Speech Slytherin: Examining the Performance and Efficiency of Mamba for Speech Separation, Recognition, and Synthesis Jul 13, 2024 Mamba speech-recognition
Code Code Available 2Knowledge boosting during low-latency inference Jul 9, 2024 Speech Separation
Code Code Available 0Audio-Visual Approach For Multimodal Concurrent Speaker Detection Jul 1, 2024 Multimodal Deep Learning speaker-diarization
— Unverified 0Papez: Resource-Efficient Speech Separation with Auditory Working Memory Jul 1, 2024 Speech Separation
Code Code Available 1Towards Audio Codec-based Speech Separation Jun 18, 2024 Edge-computing Speech Separation
Code Code Available 1Text-aware Speech Separation for Multi-talker Keyword Spotting Jun 18, 2024 Keyword Spotting Speech Separation
Code Code Available 1AV-CrossNet: an Audiovisual Complex Spectral Mapping Network for Speech Separation By Leveraging Narrow- and Cross-Band Modeling Jun 17, 2024 Speaker Separation Speech Enhancement
Code Code Available 1Enhanced Deep Speech Separation in Clustered Ad Hoc Distributed Microphone Environments Jun 14, 2024 Deep Learning Speech Separation
— Unverified 0