A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models Jun 1, 2023 Data Augmentation Speech Enhancement
Code Code Available 15 Group Communication with Context Codec for Lightweight Source Separation Dec 14, 2020 Decoder Speech Enhancement
Code Code Available 15 A Study on Speech Enhancement Based on Diffusion Probabilistic Model Jul 25, 2021 Speech Enhancement
Code Code Available 15 HiFi++: a Unified Framework for Bandwidth Extension and Speech Enhancement Mar 24, 2022 Audio Generation Bandwidth Extension
Code Code Available 15 HiFi-Stream: Streaming Speech Enhancement with Generative Adversarial Networks Mar 21, 2025 Speech Enhancement
Code Code Available 15 A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement Jun 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement Aug 30, 2024 Decoder Speech Enhancement
Code Code Available 15 Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement Oct 13, 2021 Speech Enhancement
Code Code Available 15 Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition Mar 28, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 DNN-based mask estimation for distributed speech enhancement in spatially unconstrained microphone arrays Nov 3, 2020 Diversity Noise Estimation
Code Code Available 15 Attention-based distributed speech enhancement for unconstrained microphone arrays with varying number of nodes Jun 15, 2021 Speech Enhancement
Code Code Available 15 Improving Speech Enhancement through Fine-Grained Speech Characteristics Jul 1, 2022 Deep Learning Speech Enhancement
Code Code Available 15 A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech Aug 27, 2020 CPU Speech Enhancement
Code Code Available 15 Improving GANs for Speech Enhancement Jan 15, 2020 Speech Enhancement
Code Code Available 15 An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and Separation Aug 21, 2020 Deep Learning Speech Enhancement
Code Code Available 15 Inference and Denoise: Causal Inference-based Neural Speech Enhancement Nov 2, 2022 Causal Inference Speech Enhancement
Code Code Available 15 AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder Jan 9, 2025 Pitch Classification Pitch control
Code Code Available 15 Instantaneous PSD Estimation for Speech Enhancement based on Generalized Principal Components Jul 1, 2020 Speech Enhancement
Code Code Available 15 A light-weight full-band speech enhancement model Jun 29, 2022 Speech Enhancement
Code Code Available 15 A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement Aug 26, 2021 Speech Enhancement
Code Code Available 15 L3DAS21 Challenge: Machine Learning for 3D Audio Signal Processing Apr 12, 2021 Audio Signal Processing BIG-bench Machine Learning
Code Code Available 15 iSEGAN: Improved Speech Enhancement Generative Adversarial Networks Feb 20, 2020 Speech Enhancement
Code Code Available 15 A Lightweight and Real-Time Binaural Speech Enhancement Model with Spatial Cues Preservation Sep 19, 2024 Speech Enhancement
Code Code Available 15 Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis Mar 31, 2022 Speech Enhancement
Code Code Available 15 Disentanglement in a GAN for Unconditional Speech Synthesis Jul 4, 2023 Disentanglement Generative Adversarial Network
Code Code Available 15 Learning Audio-Visual Dereverberation Jun 14, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 EasyCom: An Augmented Reality Dataset to Support Algorithms for Easy Communication in Noisy Environments Jul 9, 2021 Speech Enhancement
Code Code Available 15 Multi-Task Audio Source Separation Jul 14, 2021 Audio Source Separation Multi-task Audio Source Seperation
Code Code Available 15 A non-causal FFTNet architecture for speech enhancement Jun 8, 2020 Speech Enhancement
Code Code Available 15 Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Probabilistic Models Sep 14, 2023 Speaker Verification Speech Enhancement
Code Code Available 15 Diffusion-based Generative Speech Source Separation Oct 31, 2022 Speech Enhancement
Code Code Available 15 DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement Dec 15, 2022 Denoising Speech Dereverberation
Code Code Available 15 DeFTAN-II: Efficient Multichannel Speech Enhancement with Subgroup Processing Aug 30, 2023 Speech Enhancement
Code Code Available 15 Diffusion-Based Mel-Spectrogram Enhancement for Personalized Speech Synthesis with Found Data May 18, 2023 Speech Enhancement Speech Synthesis
Code Code Available 15 MetricGAN: Generative Adversarial Networks based Black-box Metric Scores Optimization for Speech Enhancement May 13, 2019 Generative Adversarial Network Speech Enhancement
Code Code Available 15 MeshRIR: A Dataset of Room Impulse Responses on Meshed Grid Points For Evaluating Sound Field Analysis and Synthesis Methods Jun 21, 2021 Distant Speech Recognition Room Impulse Response (RIR)
Code Code Available 15 BASPRO: a balanced script producer for speech corpus collection based on the genetic algorithm Dec 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 BASEN: Time-Domain Brain-Assisted Speech Enhancement Network with Convolutional Cross Attention in Multi-talker Conditions May 17, 2023 EEG Speech Enhancement
Code Code Available 15 Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement Oct 27, 2022 Denoising Speech Enhancement
Code Code Available 15 AV-CrossNet: an Audiovisual Complex Spectral Mapping Network for Speech Separation By Leveraging Narrow- and Cross-Band Modeling Jun 17, 2024 Speaker Separation Speech Enhancement
Code Code Available 15 A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition May 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Adaptive Convolution for CNN-based Speech Enhancement Models Feb 20, 2025 Decoder Speech Enhancement
Code Code Available 15 MetricGAN+: An Improved Version of MetricGAN for Speech Enhancement Apr 8, 2021 Speech Enhancement
Code Code Available 15 MetricGAN-OKD: Multi-Metric Optimization of MetricGAN via Online Knowledge Distillation for Speech Enhancement Jul 24, 2023 Knowledge Distillation Speech Enhancement
Code Code Available 15 AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection Jan 5, 2019 Active Speaker Detection Audio-Visual Active Speaker Detection
Code Code Available 15 An Investigation of End-to-End Models for Robust Speech Recognition Feb 11, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 dEchorate: a Calibrated Room Impulse Response Database for Echo-aware Signal Processing Apr 27, 2021 Benchmarking Retrieval
Code Code Available 15 MANNER: Multi-view Attention Network for Noise Erasure Mar 4, 2022 Decoder Speech Enhancement
Code Code Available 15 Deep Residual-Dense Lattice Network for Speech Enhancement Feb 27, 2020 Speech Enhancement
Code Code Available 15 Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features Nov 3, 2021 Prediction Speech Enhancement
Code Code Available 15