Task-Aware Unified Source Separation Oct 31, 2024 Audio Source Separation Music Source Separation
— Unverified 0Run-Time Adaptation of Neural Beamforming for Robust Speech Dereverberation and Denoising Oct 30, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Simultaneous Diarization and Separation of Meetings through the Integration of Statistical Mixture Models Oct 28, 2024 Speech Enhancement
— Unverified 0ELAICHI: Enhancing Low-resource TTS by Addressing Infrequent and Low-frequency Character Bigrams Oct 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Using RLHF to align speech enhancement approaches to mean-opinion quality scores Oct 17, 2024 Speech Enhancement
— Unverified 0GAN-Based Speech Enhancement for Low SNR Using Latent Feature Conditioning Oct 17, 2024 Generative Adversarial Network Speech Enhancement
— Unverified 0FINALLY: fast and universal speech enhancement with studio-like quality Oct 8, 2024 Speech Enhancement
— Unverified 0Towards Ultra-Low-Power Neuromorphic Speech Enhancement with Spiking-FullSubNet Oct 7, 2024 Denoising Speech Denoising
Code Code Available 2RelUNet: Relative Channel Fusion U-Net for Multichannel Speech Enhancement Oct 7, 2024 Speech Enhancement
— Unverified 0Diffusion-based Unsupervised Audio-visual Speech Enhancement Oct 4, 2024 Speech Enhancement
— Unverified 0Restorative Speech Enhancement: A Progressive Approach Using SE and Codec Modules Oct 2, 2024 Quantization Speech Enhancement
— Unverified 0SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios Oct 2, 2024 Speech Enhancement Speech Separation
Code Code Available 3Advanced Clustering Techniques for Speech Signal Enhancement: A Review and Metanalysis of Fuzzy C-Means, K-Means, and Kernel Fuzzy C-Means Methods Sep 28, 2024 Clustering Speech Enhancement
— Unverified 0Speech Boosting: Low-Latency Live Speech Enhancement for TWS Earbuds Sep 27, 2024 Speech Enhancement
— Unverified 0Towards Sub-millisecond Latency Real-Time Speech Enhancement Models on Hearables Sep 26, 2024 Speech Enhancement
— Unverified 0MC-SEMamba: A Simple Multi-channel Extension of SEMamba Sep 26, 2024 Mamba Speech Enhancement
— Unverified 0An Explicit Consistency-Preserving Loss Function for Phase Reconstruction and Speech Enhancement Sep 24, 2024 Speech Enhancement
— Unverified 0Robust Audio-Visual Speech Enhancement: Correcting Misassignments in Complex Environments with Advanced Post-Processing Sep 22, 2024 Speech Enhancement
— Unverified 0Self-Supervised Audio-Visual Soundscape Stylization Sep 22, 2024 Speech Enhancement
— Unverified 0LiSenNet: Lightweight Sub-band and Dual-Path Modeling for Real-Time Speech Enhancement Sep 20, 2024 Speech Enhancement
Code Code Available 2Geometry-Constrained EEG Channel Selection for Brain-Assisted Speech Enhancement Sep 19, 2024 channel selection EEG
— Unverified 0Speech-Declipping Transformer with Complex Spectrogram and Learnerble Temporal Features Sep 19, 2024 Speech Enhancement
— Unverified 0A Lightweight and Real-Time Binaural Speech Enhancement Model with Spatial Cues Preservation Sep 19, 2024 Speech Enhancement
Code Code Available 1Dense-TSNet: Dense Connected Two-Stage Structure for Ultra-Lightweight Speech Enhancement Sep 18, 2024 Mamba Speech Enhancement
— Unverified 0High-Resolution Speech Restoration with Latent Diffusion Model Sep 17, 2024 model Speech Enhancement
Code Code Available 0TCG CREST System Description for the Second DISPLACE Challenge Sep 16, 2024 Action Detection Activity Detection
— Unverified 0Investigating Training Objectives for Generative Speech Enhancement Sep 16, 2024 Speech Enhancement
Code Code Available 0Leveraging Joint Spectral and Spatial Learning with MAMBA for Multichannel Speech Enhancement Sep 16, 2024 Mamba Speech Enhancement
— Unverified 0Ultra-Low Latency Speech Enhancement - A Comprehensive Study Sep 16, 2024 Mamba Speech Enhancement
— Unverified 0Apollo: Band-sequence Modeling for High-Quality Audio Restoration Sep 13, 2024 Computational Efficiency Speech Enhancement
Code Code Available 3Rethinking Mamba in Speech Processing by Self-Supervised Models Sep 11, 2024 Mamba Speech Enhancement
— Unverified 0DeWinder: Single-Channel Wind Noise Reduction using Ultrasound Sensing Sep 10, 2024 Speech Enhancement
— Unverified 0IndicVoices-R: Unlocking a Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS Sep 9, 2024 Denoising Speech Enhancement
Code Code Available 2TF-Mamba: A Time-Frequency Network for Sound Source Localization Sep 8, 2024 Mamba Sound Source Localization
— Unverified 0Diffusion-based Speech Enhancement with Schrödinger Bridge and Symmetric Noise Schedule Sep 8, 2024 Speech Enhancement
— Unverified 0aTENNuate: Optimized Real-time Speech Enhancement with Deep SSMs on Raw Audio Sep 5, 2024 Audio Denoising Denoising
— Unverified 0LSTMSE-Net: Long Short Term Speech Enhancement Network for Audio-visual Speech Enhancement Sep 3, 2024 Decoder Speech Enhancement
Code Code Available 1Effective Noise-aware Data Simulation for Domain-adaptive Speech Enhancement Leveraging Dynamic Stochastic Perturbation Sep 3, 2024 Speech Enhancement
Code Code Available 0Progressive Residual Extraction based Pre-training for Speech Representation Learning Aug 31, 2024 Emotion Recognition Representation Learning
— Unverified 0Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement Aug 30, 2024 Decoder Speech Enhancement
Code Code Available 1Spectral Masking with Explicit Time-Context Windowing for Neural Network-Based Monaural Speech Enhancement Aug 28, 2024 Speech Enhancement
— Unverified 0Dynamic Gated Recurrent Neural Network for Compute-efficient Speech Enhancement Aug 22, 2024 Speech Enhancement
— Unverified 0DPSNN: Spiking Neural Network for Low-Latency Streaming Speech Enhancement Aug 14, 2024 Automatic Speech Recognition Speech Enhancement
— Unverified 0Direction of Arrival Correction through Speech Quality Feedback Aug 13, 2024 Speech Enhancement
Code Code Available 0Heterogeneous Space Fusion and Dual-Dimension Attention: A New Paradigm for Speech Enhancement Aug 13, 2024 Self-Supervised Learning Speech Enhancement
— Unverified 0BSS-CFFMA: Cross-Domain Feature Fusion and Multi-Attention Speech Enhancement Network based on Self-Supervised Embedding Aug 13, 2024 Denoising Self-Supervised Learning
Code Code Available 0One-Shot Distributed Node-Specific Signal Estimation with Non-Overlapping Latent Subspaces in Acoustic Sensor Networks Aug 7, 2024 Speech Enhancement
— Unverified 0TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement Aug 6, 2024 Speech Enhancement Speech Separation
Code Code Available 2ctPuLSE: Close-Talk, and Pseudo-Label Based Far-Field, Speech Enhancement Jul 28, 2024 Pseudo Label Speech Enhancement
— Unverified 0Speech Bandwidth Expansion Via High Fidelity Generative Adversarial Networks Jul 26, 2024 Generative Adversarial Network Speech Enhancement
— Unverified 0