A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models Jun 1, 2023 Data Augmentation Speech Enhancement
Code Code Available 1High Fidelity Speech Enhancement with Band-split RNN Dec 1, 2022 Speech Enhancement Vocal Bursts Intensity Prediction
Code Code Available 1Inference and Denoise: Causal Inference-based Neural Speech Enhancement Nov 2, 2022 Causal Inference Speech Enhancement
Code Code Available 1A Refining Underlying Information Framework for Monaural Speech Enhancement Dec 18, 2023 Speech Enhancement
Code Code Available 1Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition Feb 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement Jun 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Group Communication with Context Codec for Lightweight Source Separation Dec 14, 2020 Decoder Speech Enhancement
Code Code Available 1MANNER: Multi-view Attention Network for Noise Erasure Mar 4, 2022 Decoder Speech Enhancement
Code Code Available 1HGCN: Harmonic gated compensation network for speech enhancement Jan 30, 2022 Action Detection Activity Detection
Code Code Available 1MeshRIR: A Dataset of Room Impulse Responses on Meshed Grid Points For Evaluating Sound Field Analysis and Synthesis Methods Jun 21, 2021 Distant Speech Recognition Room Impulse Response (RIR)
Code Code Available 1A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech Aug 27, 2020 CPU Speech Enhancement
Code Code Available 1MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech Oct 12, 2021 Speech Enhancement
Code Code Available 1An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and Separation Aug 21, 2020 Deep Learning Speech Enhancement
Code Code Available 1MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator Sep 23, 2022 Speech Enhancement
Code Code Available 1Multi-dimensional Speech Quality Assessment in Crowdsourcing Sep 14, 2023 Speech Enhancement
Code Code Available 1MultiSV: Dataset for Far-Field Multi-Channel Speaker Verification Nov 11, 2021 Denoising Speaker Verification
Code Code Available 1AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder Jan 9, 2025 Pitch Classification Pitch control
Code Code Available 1Noise-aware Speech Enhancement using Diffusion Probabilistic Model Jul 16, 2023 Denoising model
Code Code Available 1A light-weight full-band speech enhancement model Jun 29, 2022 Speech Enhancement
Code Code Available 1Perceptual Contrast Stretching on Target Feature for Speech Enhancement Mar 31, 2022 Speech Enhancement
Code Code Available 1Phase-aware Speech Enhancement with Deep Complex U-Net Mar 7, 2019 Speech Enhancement valid
Code Code Available 1Phonetic Feedback for Speech Enhancement With and Without Parallel Speech Data Mar 3, 2020 Speech Enhancement
Code Code Available 1A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement Aug 26, 2021 Speech Enhancement
Code Code Available 1Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis Mar 31, 2022 Speech Enhancement
Code Code Available 1A Lightweight and Real-Time Binaural Speech Enhancement Model with Spatial Cues Preservation Sep 19, 2024 Speech Enhancement
Code Code Available 1Reducing the Prior Mismatch of Stochastic Differential Equations for Diffusion-based Speech Enhancement Feb 28, 2023 Speech Enhancement
Code Code Available 1FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement Oct 29, 2020 Speech Enhancement
Code Code Available 1HiFi++: a Unified Framework for Bandwidth Extension and Speech Enhancement Mar 24, 2022 Audio Generation Bandwidth Extension
Code Code Available 1Insights Into Deep Non-linear Filters for Improved Multi-channel Speech Enhancement Jun 27, 2022 Speech Enhancement
Code Code Available 1A non-causal FFTNet architecture for speech enhancement Jun 8, 2020 Speech Enhancement
Code Code Available 1Fast Multichannel Source Separation Based on Jointly Diagonalizable Spatial Covariance Matrices Mar 8, 2019 Speech Enhancement
Code Code Available 1Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement Jul 25, 2020 regression Speech Enhancement
Code Code Available 1Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection from Speech in Real-World Operative Conditions Jun 23, 2024 Audio Classification Parkinson Detection from Speech
Code Code Available 1FaSNet: Low-latency Adaptive Beamforming for Multi-microphone Audio Processing Sep 29, 2019 Speech Enhancement speech-recognition
Code Code Available 1EasyCom: An Augmented Reality Dataset to Support Algorithms for Easy Communication in Noisy Environments Jul 9, 2021 Speech Enhancement
Code Code Available 1BASEN: Time-Domain Brain-Assisted Speech Enhancement Network with Convolutional Cross Attention in Multi-talker Conditions May 17, 2023 EEG Speech Enhancement
Code Code Available 1Explainable DNN-based Beamformer with Postfilter Nov 16, 2024 Speech Enhancement
Code Code Available 1Binaural Speech Enhancement Using Deep Complex Convolutional Transformer Networks Mar 8, 2024 Decoder Speech Enhancement
Code Code Available 1AV-CrossNet: an Audiovisual Complex Spectral Mapping Network for Speech Separation By Leveraging Narrow- and Cross-Band Modeling Jun 17, 2024 Speaker Separation Speech Enhancement
Code Code Available 1A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition May 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Adaptive Convolution for CNN-based Speech Enhancement Models Feb 20, 2025 Decoder Speech Enhancement
Code Code Available 1AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection Jan 5, 2019 Active Speaker Detection Audio-Visual Active Speaker Detection
Code Code Available 1Disentanglement in a GAN for Unconditional Speech Synthesis Jul 4, 2023 Disentanglement Generative Adversarial Network
Code Code Available 1Diffusion-based Generative Speech Source Separation Oct 31, 2022 Speech Enhancement
Code Code Available 1Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement Oct 27, 2022 Denoising Speech Enhancement
Code Code Available 1An Investigation of End-to-End Models for Robust Speech Recognition Feb 11, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Probabilistic Models Sep 14, 2023 Speaker Verification Speech Enhancement
Code Code Available 1BASPRO: a balanced script producer for speech corpus collection based on the genetic algorithm Dec 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Diffusion-Based Mel-Spectrogram Enhancement for Personalized Speech Synthesis with Found Data May 18, 2023 Speech Enhancement Speech Synthesis
Code Code Available 1DNN-based mask estimation for distributed speech enhancement in spatially unconstrained microphone arrays Nov 3, 2020 Diversity Noise Estimation
Code Code Available 1