Autoregressive Speech Enhancement via Acoustic Tokens Jul 17, 2025 Speech Enhancement
— Unverified 0P.808 Multilingual Speech Enhancement Testing: Approach and Results of URGENT 2025 Challenge Jul 15, 2025 Speech Enhancement text-to-speech
— Unverified 0Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis Jul 8, 2025 Data Augmentation Mixture-of-Experts
— Unverified 0Robust One-step Speech Enhancement via Consistency Distillation Jul 8, 2025 Speech Enhancement
Code Code Available 1MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement Jul 1, 2025 Automatic Speech Recognition Mamba
Code Code Available 2Frequency-Weighted Training Losses for Phoneme-Level DNN-based Speech Enhancement Jun 23, 2025 Speech Enhancement
— Unverified 0EDNet: A Distortion-Agnostic Speech Enhancement Framework with Gating Mamba Mechanism and Phase Shift-Invariant Training Jun 19, 2025 Bandwidth Extension Denoising
— Unverified 0A Comparative Evaluation of Deep Learning Models for Speech Enhancement in Real-World Noisy Environments Jun 17, 2025 Denoising Speaker Recognition
— Unverified 0Efficient Speech Enhancement via Embeddings from Pre-trained Generative Audioencoders Jun 13, 2025 Speech Enhancement
Code Code Available 2Exploring Length Generalization For Transformer-based Speech Enhancement Jun 7, 2025 Speech Enhancement
— Unverified 0French Listening Tests for the Assessment of Intelligibility, Quality, and Identity of Body-Conducted Speech Enhancement Jun 4, 2025 Bandwidth Extension Speaker Identification
— Unverified 0Diffusion Buffer: Online Diffusion-based Speech Enhancement with Sub-Second Latency Jun 3, 2025 GPU Speech Enhancement
— Unverified 0Lessons Learned from the URGENT 2024 Speech Enhancement Challenge Jun 2, 2025 Speech Enhancement
Code Code Available 0A Two-Stage Hierarchical Deep Filtering Framework for Real-Time Speech Enhancement Jun 1, 2025 Speech Enhancement
— Unverified 0A Composite Predictive-Generative Approach to Monaural Universal Speech Enhancement May 30, 2025 Denoising Speech Enhancement
— Unverified 0Boosting Domain Incremental Learning: Selecting the Optimal Parameters is All You Need May 29, 2025 All image-classification
Code Code Available 0Interspeech 2025 URGENT Speech Enhancement Challenge May 29, 2025 Diversity Speech Enhancement
— Unverified 0DeepFilterGAN: A Full-band Real-time Speech Enhancement System with GAN-based Stochastic Regeneration May 29, 2025 Speech Enhancement
— Unverified 0ARiSE: Auto-Regressive Multi-Channel Speech Enhancement May 28, 2025 Speech Enhancement
— Unverified 0Study of Lightweight Transformer Architectures for Single-Channel Speech Enhancement May 27, 2025 Speech Enhancement
— Unverified 0Model as Loss: A Self-Consistent Training Paradigm May 27, 2025 Decoder Speech Enhancement
— Unverified 0Stack Less, Repeat More: A Block Reusing Approach for Progressive Speech Enhancement May 26, 2025 Decoder Speech Enhancement
— Unverified 0A Lightweight Hybrid Dual Channel Speech Enhancement System under Low-SNR Conditions May 26, 2025 Speech Enhancement
Code Code Available 2FlowSE: Efficient and High-Quality Speech Enhancement via Flow Matching May 26, 2025 Quantization Speech Enhancement
Code Code Available 2Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement May 26, 2025 Speech Enhancement
Code Code Available 1Training-Free Multi-Step Audio Source Separation May 26, 2025 Audio Source Separation Denoising
Code Code Available 2TS-URGENet: A Three-stage Universal Robust and Generalizable Speech Enhancement Network May 24, 2025 Speech Enhancement
— Unverified 0Active Speech Enhancement: Active Speech Denoising Decliping and Deveraberation May 22, 2025 Denoising Mamba
— Unverified 0Improving Noise Robustness of LLM-based Zero-shot TTS via Discrete Acoustic Token Denoising May 20, 2025 Decoder Denoising
— Unverified 0A Semantic Information-based Hierarchical Speech Enhancement Method Using Factorized Codec and Diffusion Model May 20, 2025 Speech Enhancement
— Unverified 0MDDM: A Multi-view Discriminative Enhanced Diffusion-based Model for Speech Enhancement May 19, 2025 Speech Enhancement
— Unverified 0RoVo: Robust Voice Protection Against Unauthorized Speech Synthesis with Embedding-Level Perturbations May 19, 2025 Speaker Verification Speech Enhancement
— Unverified 0Unified Architecture and Unsupervised Speech Disentanglement for Speaker Embedding-Free Enrollment in Personalized Speech Enhancement May 18, 2025 Disentanglement Speech Enhancement
— Unverified 0A Survey of Deep Learning for Complex Speech Spectrograms May 13, 2025 Deep Learning Speech Enhancement
— Unverified 0Normalize Everything: A Preconditioned Magnitude-Preserving Architecture for Diffusion-Based Speech Enhancement May 8, 2025 Image Generation Speech Enhancement
— Unverified 0Robust Speech Recognition with Schrödinger Bridge-Based Speech Enhancement May 7, 2025 Robust Speech Recognition Speech Enhancement
— Unverified 0SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer May 7, 2025 Audio-Visual Speech Recognition Lip Reading
— Unverified 0CoGenAV: Versatile Audio-Visual Representation Learning via Contrastive-Generative Synchronization May 6, 2025 Active Speaker Detection Audio-Visual Speech Recognition
Code Code Available 2How much to Dereverberate? Low-Latency Single-Channel Speech Enhancement in Distant Microphone Scenarios May 2, 2025 Speech Enhancement
— Unverified 0Predicting speech intelligibility in older adults using the Gammachirp Envelope Similarity Index, GESI Apr 20, 2025 Speech Enhancement
— Unverified 0DiTSE: High-Fidelity Generative Speech Enhancement via Latent Diffusion Transformers Apr 13, 2025 Hallucination Speech Enhancement
— Unverified 0Spatial-Filter-Bank-Based Neural Method for Multichannel Speech Enhancement Apr 2, 2025 Speech Enhancement
— Unverified 0Magnitude-Phase Dual-Path Speech Enhancement Network based on Self-Supervised Embedding and Perceptual Contrast Stretch Boosting Mar 27, 2025 Self-Supervised Learning Speech Enhancement
Code Code Available 0A Low-Power Streaming Speech Enhancement Accelerator For Edge Devices Mar 27, 2025 Model Compression Speech Enhancement
— Unverified 0Joint Spectrogram Separation and TDOA Estimation using Optimal Transport Mar 24, 2025 blind source separation Speech Enhancement
— Unverified 0HiFi-Stream: Streaming Speech Enhancement with Generative Adversarial Networks Mar 21, 2025 Speech Enhancement
Code Code Available 1A Speech Production Model for Radar: Connecting Speech Acoustics with Radar-Measured Vibrations Mar 19, 2025 Speech Enhancement
— Unverified 0Variational Autoencoder for Personalized Pathological Speech Enhancement Mar 18, 2025 Speech Enhancement
— Unverified 0Room Impulse Response Estimation through Optimal Mass Transport Barycenters Mar 18, 2025 Speech Enhancement
Code Code Available 0FNSE-SBGAN: Far-field Speech Enhancement with Schrodinger Bridge and Generative Adversarial Networks Mar 17, 2025 Speech Enhancement
Code Code Available 1