Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses Feb 3, 2021 Decoder Speech Denoising
Code Code Available 2Real Time Speech Enhancement in the Waveform Domain Jun 23, 2020 CPU Data Augmentation
Code Code Available 2VoiceFilter: Targeted Voice Separation by Speaker-Conditioned Spectrogram Masking Oct 11, 2018 Speaker Recognition Speaker Separation
Code Code Available 2Proximal Policy Optimization Algorithms Jul 20, 2017 Continuous Control Dota 2
Code Code Available 2SEGAN: Speech Enhancement Generative Adversarial Network Mar 28, 2017 Generative Adversarial Network Speech Enhancement
Code Code Available 2Robust One-step Speech Enhancement via Consistency Distillation Jul 8, 2025 Speech Enhancement
Code Code Available 1Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement May 26, 2025 Speech Enhancement
Code Code Available 1HiFi-Stream: Streaming Speech Enhancement with Generative Adversarial Networks Mar 21, 2025 Speech Enhancement
Code Code Available 1FNSE-SBGAN: Far-field Speech Enhancement with Schrodinger Bridge and Generative Adversarial Networks Mar 17, 2025 Speech Enhancement
Code Code Available 1PrimeK-Net: Multi-scale Spectral Learning via Group Prime-Kernel Convolutional Neural Networks for Single Channel Speech Enhancement Feb 27, 2025 Computational Efficiency Speech Enhancement
Code Code Available 1Adaptive Convolution for CNN-based Speech Enhancement Models Feb 20, 2025 Decoder Speech Enhancement
Code Code Available 1SEF-PNet: Speaker Encoder-Free Personalized Speech Enhancement with Local and Global Contexts Aggregation Jan 20, 2025 Speaker Verification Speech Enhancement
Code Code Available 1AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder Jan 9, 2025 Pitch Classification Pitch control
Code Code Available 1Source Separation & Automatic Transcription for Music Dec 9, 2024 Music Transcription Speech Enhancement
Code Code Available 1Explainable DNN-based Beamformer with Postfilter Nov 16, 2024 Speech Enhancement
Code Code Available 1A Lightweight and Real-Time Binaural Speech Enhancement Model with Spatial Cues Preservation Sep 19, 2024 Speech Enhancement
Code Code Available 1LSTMSE-Net: Long Short Term Speech Enhancement Network for Audio-visual Speech Enhancement Sep 3, 2024 Decoder Speech Enhancement
Code Code Available 1Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement Aug 30, 2024 Decoder Speech Enhancement
Code Code Available 1Vibravox: A Dataset of French Speech Captured with Body-conduction Audio Sensors Jul 16, 2024 Automatic Phoneme Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection from Speech in Real-World Operative Conditions Jun 23, 2024 Audio Classification Parkinson Detection from Speech
Code Code Available 1AV-CrossNet: an Audiovisual Complex Spectral Mapping Network for Speech Separation By Leveraging Narrow- and Cross-Band Modeling Jun 17, 2024 Speaker Separation Speech Enhancement
Code Code Available 1Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement Jun 6, 2024 Diversity Speech Enhancement
Code Code Available 1Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment Jun 5, 2024 Attribute Speech Enhancement
Code Code Available 1A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition May 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Binaural Speech Enhancement Using Deep Complex Convolutional Transformer Networks Mar 8, 2024 Decoder Speech Enhancement
Code Code Available 1A Refining Underlying Information Framework for Monaural Speech Enhancement Dec 18, 2023 Speech Enhancement
Code Code Available 1Investigating the Design Space of Diffusion Models for Speech Enhancement Dec 7, 2023 Image Generation Speech Enhancement
Code Code Available 1D4AM: A General Denoising Framework for Downstream Acoustic Models Nov 28, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Unsupervised speech enhancement with diffusion-based generative models Sep 19, 2023 Speech Enhancement
Code Code Available 1Single and Few-step Diffusion for Generative Speech Enhancement Sep 18, 2023 Denoising Speech Enhancement
Code Code Available 1Multi-dimensional Speech Quality Assessment in Crowdsourcing Sep 14, 2023 Speech Enhancement
Code Code Available 1Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Probabilistic Models Sep 14, 2023 Speaker Verification Speech Enhancement
Code Code Available 1Simulating room transfer functions between transducers mounted on audio devices using a modified image source method Sep 7, 2023 Computational Efficiency parameter estimation
Code Code Available 1DeFTAN-II: Efficient Multichannel Speech Enhancement with Subgroup Processing Aug 30, 2023 Speech Enhancement
Code Code Available 1MetricGAN-OKD: Multi-Metric Optimization of MetricGAN via Online Knowledge Distillation for Speech Enhancement Jul 24, 2023 Knowledge Distillation Speech Enhancement
Code Code Available 1Noise-aware Speech Enhancement using Diffusion Probabilistic Model Jul 16, 2023 Denoising model
Code Code Available 1Disentanglement in a GAN for Unconditional Speech Synthesis Jul 4, 2023 Disentanglement Generative Adversarial Network
Code Code Available 1Variance-Preserving-Based Interpolation Diffusion Models for Speech Enhancement Jun 14, 2023 Speech Enhancement
Code Code Available 1A Mask Free Neural Network for Monaural Speech Enhancement Jun 7, 2023 Speech Enhancement
Code Code Available 1A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models Jun 1, 2023 Data Augmentation Speech Enhancement
Code Code Available 1Diffusion-Based Mel-Spectrogram Enhancement for Personalized Speech Synthesis with Found Data May 18, 2023 Speech Enhancement Speech Synthesis
Code Code Available 1BASEN: Time-Domain Brain-Assisted Speech Enhancement Network with Convolutional Cross Attention in Multi-talker Conditions May 17, 2023 EEG Speech Enhancement
Code Code Available 1Integrating Uncertainty into Neural Network-based Speech Enhancement May 15, 2023 Speech Enhancement
Code Code Available 1Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text Representations Mar 3, 2023 Speech Denoising Speech Enhancement
Code Code Available 1Reducing the Prior Mismatch of Stochastic Differential Equations for Diffusion-based Speech Enhancement Feb 28, 2023 Speech Enhancement
Code Code Available 1Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition Feb 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation Feb 22, 2023 Multi-Task Learning Speech Enhancement
Code Code Available 1TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement Feb 16, 2023 Speaker Recognition Speech Enhancement
Code Code Available 1PAAPLoss: A Phonetic-Aligned Acoustic Parameter Loss for Speech Enhancement Feb 16, 2023 Speech Enhancement Time Series
Code Code Available 1DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement Dec 15, 2022 Denoising Speech Dereverberation
Code Code Available 1