AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection Jan 5, 2019 Active Speaker Detection Audio-Visual Active Speaker Detection
Code Code Available 1Perceptual Losses for Real-Time Style Transfer and Super-Resolution Mar 27, 2016 Image Super-Resolution Nuclear Segmentation
Code Code Available 1Autoregressive Speech Enhancement via Acoustic Tokens Jul 17, 2025 Speech Enhancement
— Unverified 0P.808 Multilingual Speech Enhancement Testing: Approach and Results of URGENT 2025 Challenge Jul 15, 2025 Speech Enhancement text-to-speech
— Unverified 0Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis Jul 8, 2025 Data Augmentation Mixture-of-Experts
— Unverified 0Frequency-Weighted Training Losses for Phoneme-Level DNN-based Speech Enhancement Jun 23, 2025 Speech Enhancement
— Unverified 0EDNet: A Distortion-Agnostic Speech Enhancement Framework with Gating Mamba Mechanism and Phase Shift-Invariant Training Jun 19, 2025 Bandwidth Extension Denoising
— Unverified 0A Comparative Evaluation of Deep Learning Models for Speech Enhancement in Real-World Noisy Environments Jun 17, 2025 Denoising Speaker Recognition
— Unverified 0Exploring Length Generalization For Transformer-based Speech Enhancement Jun 7, 2025 Speech Enhancement
— Unverified 0French Listening Tests for the Assessment of Intelligibility, Quality, and Identity of Body-Conducted Speech Enhancement Jun 4, 2025 Bandwidth Extension Speaker Identification
— Unverified 0Diffusion Buffer: Online Diffusion-based Speech Enhancement with Sub-Second Latency Jun 3, 2025 GPU Speech Enhancement
— Unverified 0Lessons Learned from the URGENT 2024 Speech Enhancement Challenge Jun 2, 2025 Speech Enhancement
Code Code Available 0A Two-Stage Hierarchical Deep Filtering Framework for Real-Time Speech Enhancement Jun 1, 2025 Speech Enhancement
— Unverified 0A Composite Predictive-Generative Approach to Monaural Universal Speech Enhancement May 30, 2025 Denoising Speech Enhancement
— Unverified 0DeepFilterGAN: A Full-band Real-time Speech Enhancement System with GAN-based Stochastic Regeneration May 29, 2025 Speech Enhancement
— Unverified 0Interspeech 2025 URGENT Speech Enhancement Challenge May 29, 2025 Diversity Speech Enhancement
— Unverified 0Boosting Domain Incremental Learning: Selecting the Optimal Parameters is All You Need May 29, 2025 All image-classification
Code Code Available 0ARiSE: Auto-Regressive Multi-Channel Speech Enhancement May 28, 2025 Speech Enhancement
— Unverified 0Study of Lightweight Transformer Architectures for Single-Channel Speech Enhancement May 27, 2025 Speech Enhancement
— Unverified 0Model as Loss: A Self-Consistent Training Paradigm May 27, 2025 Decoder Speech Enhancement
— Unverified 0Stack Less, Repeat More: A Block Reusing Approach for Progressive Speech Enhancement May 26, 2025 Decoder Speech Enhancement
— Unverified 0TS-URGENet: A Three-stage Universal Robust and Generalizable Speech Enhancement Network May 24, 2025 Speech Enhancement
— Unverified 0Active Speech Enhancement: Active Speech Denoising Decliping and Deveraberation May 22, 2025 Denoising Mamba
— Unverified 0Improving Noise Robustness of LLM-based Zero-shot TTS via Discrete Acoustic Token Denoising May 20, 2025 Decoder Denoising
— Unverified 0A Semantic Information-based Hierarchical Speech Enhancement Method Using Factorized Codec and Diffusion Model May 20, 2025 Speech Enhancement
— Unverified 0MDDM: A Multi-view Discriminative Enhanced Diffusion-based Model for Speech Enhancement May 19, 2025 Speech Enhancement
— Unverified 0RoVo: Robust Voice Protection Against Unauthorized Speech Synthesis with Embedding-Level Perturbations May 19, 2025 Speaker Verification Speech Enhancement
— Unverified 0Unified Architecture and Unsupervised Speech Disentanglement for Speaker Embedding-Free Enrollment in Personalized Speech Enhancement May 18, 2025 Disentanglement Speech Enhancement
— Unverified 0A Survey of Deep Learning for Complex Speech Spectrograms May 13, 2025 Deep Learning Speech Enhancement
— Unverified 0Normalize Everything: A Preconditioned Magnitude-Preserving Architecture for Diffusion-Based Speech Enhancement May 8, 2025 Image Generation Speech Enhancement
— Unverified 0Robust Speech Recognition with Schrödinger Bridge-Based Speech Enhancement May 7, 2025 Robust Speech Recognition Speech Enhancement
— Unverified 0SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer May 7, 2025 Audio-Visual Speech Recognition Lip Reading
— Unverified 0How much to Dereverberate? Low-Latency Single-Channel Speech Enhancement in Distant Microphone Scenarios May 2, 2025 Speech Enhancement
— Unverified 0Predicting speech intelligibility in older adults using the Gammachirp Envelope Similarity Index, GESI Apr 20, 2025 Speech Enhancement
— Unverified 0DiTSE: High-Fidelity Generative Speech Enhancement via Latent Diffusion Transformers Apr 13, 2025 Hallucination Speech Enhancement
— Unverified 0Spatial-Filter-Bank-Based Neural Method for Multichannel Speech Enhancement Apr 2, 2025 Speech Enhancement
— Unverified 0Magnitude-Phase Dual-Path Speech Enhancement Network based on Self-Supervised Embedding and Perceptual Contrast Stretch Boosting Mar 27, 2025 Self-Supervised Learning Speech Enhancement
Code Code Available 0A Low-Power Streaming Speech Enhancement Accelerator For Edge Devices Mar 27, 2025 Model Compression Speech Enhancement
— Unverified 0Joint Spectrogram Separation and TDOA Estimation using Optimal Transport Mar 24, 2025 blind source separation Speech Enhancement
— Unverified 0A Speech Production Model for Radar: Connecting Speech Acoustics with Radar-Measured Vibrations Mar 19, 2025 Speech Enhancement
— Unverified 0Room Impulse Response Estimation through Optimal Mass Transport Barycenters Mar 18, 2025 Speech Enhancement
Code Code Available 0Variational Autoencoder for Personalized Pathological Speech Enhancement Mar 18, 2025 Speech Enhancement
— Unverified 0Linguistic Knowledge Transfer Learning for Speech Enhancement Mar 10, 2025 Speech Enhancement Transfer Learning
— Unverified 0ProSE: Diffusion Priors for Speech Enhancement Mar 9, 2025 Denoising regression
— Unverified 0Enhancing Speech Quality through the Integration of BGRU and Transformer Architectures Feb 25, 2025 Speech Enhancement
— Unverified 0Speech Enhancement Using Continuous Embeddings of Neural Audio Codec Feb 22, 2025 Quantization Speech Enhancement
— Unverified 0LMFCA-Net: A Lightweight Model for Multi-Channel Speech Enhancement with Efficient Narrow-Band and Cross-Band Attention Feb 17, 2025 Speech Enhancement
— Unverified 0TAPS: Throat and Acoustic Paired Speech Dataset for Deep Learning-Based Speech Enhancement Feb 17, 2025 Speech Enhancement
— Unverified 0Microphone Array Geometry Independent Multi-Talker Distant ASR: NTT System for the DASR Task of the CHiME-8 Challenge Feb 14, 2025 Action Detection Activity Detection
— Unverified 0Advances in Microphone Array Processing and Multichannel Speech Enhancement Feb 13, 2025 Speech Enhancement
— Unverified 0