FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching Jan 9, 2025 Audio Super-Resolution Computational Efficiency
Code Code Available 25 LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT Oct 7, 2023 Audio captioning Automatic Speech Recognition
Code Code Available 25 Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement Dec 21, 2024 Mamba
Code Code Available 25 Real Time Speech Enhancement in the Waveform Domain Jun 23, 2020 CPU Data Augmentation
Code Code Available 25 UL-UNAS: Ultra-Lightweight U-Nets for Real-Time Speech Enhancement via Network Architecture Search Mar 1, 2025 Neural Architecture Search Speech Enhancement
Code Code Available 25 Integrating Uncertainty into Neural Network-based Speech Enhancement May 15, 2023 Speech Enhancement
Code Code Available 15 FaSNet: Low-latency Adaptive Beamforming for Multi-microphone Audio Processing Sep 29, 2019 Speech Enhancement speech-recognition
Code Code Available 15 Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection from Speech in Real-World Operative Conditions Jun 23, 2024 Audio Classification Parkinson Detection from Speech
Code Code Available 15 Insights Into Deep Non-linear Filters for Improved Multi-channel Speech Enhancement Jun 27, 2022 Speech Enhancement
Code Code Available 15 Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition Oct 11, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Instantaneous PSD Estimation for Speech Enhancement based on Generalized Principal Components Jul 1, 2020 Speech Enhancement
Code Code Available 15 Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement Jul 25, 2020 regression Speech Enhancement
Code Code Available 15 AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder Jan 9, 2025 Pitch Classification Pitch control
Code Code Available 15 Explainable DNN-based Beamformer with Postfilter Nov 16, 2024 Speech Enhancement
Code Code Available 15 Improving Speech Enhancement through Fine-Grained Speech Characteristics Jul 1, 2022 Deep Learning Speech Enhancement
Code Code Available 15 Inference and Denoise: Causal Inference-based Neural Speech Enhancement Nov 2, 2022 Causal Inference Speech Enhancement
Code Code Available 15 INTERSPEECH 2021 ConferencingSpeech Challenge: Towards Far-field Multi-Channel Speech Enhancement for Video Conferencing Apr 2, 2021 Speech Enhancement Task 2
Code Code Available 15 EasyCom: An Augmented Reality Dataset to Support Algorithms for Easy Communication in Noisy Environments Jul 9, 2021 Speech Enhancement
Code Code Available 15 Improved Lite Audio-Visual Speech Enhancement Aug 30, 2020 Speech Enhancement
Code Code Available 15 Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement Oct 13, 2021 Speech Enhancement
Code Code Available 15 DNN-based mask estimation for distributed speech enhancement in spatially unconstrained microphone arrays Nov 3, 2020 Diversity Noise Estimation
Code Code Available 15 Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition Mar 28, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Improving GANs for Speech Enhancement Jan 15, 2020 Speech Enhancement
Code Code Available 15 A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models Jun 1, 2023 Data Augmentation Speech Enhancement
Code Code Available 15 Disentanglement in a GAN for Unconditional Speech Synthesis Jul 4, 2023 Disentanglement Generative Adversarial Network
Code Code Available 15 Diffusion-based Generative Speech Source Separation Oct 31, 2022 Speech Enhancement
Code Code Available 15 DeFTAN-II: Efficient Multichannel Speech Enhancement with Subgroup Processing Aug 30, 2023 Speech Enhancement
Code Code Available 15 Diffusion-Based Mel-Spectrogram Enhancement for Personalized Speech Synthesis with Found Data May 18, 2023 Speech Enhancement Speech Synthesis
Code Code Available 15 Fast Multichannel Source Separation Based on Jointly Diagonalizable Spatial Covariance Matrices Mar 8, 2019 Speech Enhancement
Code Code Available 15 Improving Perceptual Quality by Phone-Fortified Perceptual Loss using Wasserstein Distance for Speech Enhancement Oct 28, 2020 Speech Enhancement
Code Code Available 15 Investigating the Design Space of Diffusion Models for Speech Enhancement Dec 7, 2023 Image Generation Speech Enhancement
Code Code Available 15 A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences Jan 13, 2020 Denoising Speech Enhancement
Code Code Available 15 A Mask Free Neural Network for Monaural Speech Enhancement Jun 7, 2023 Speech Enhancement
Code Code Available 15 Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement Aug 30, 2024 Decoder Speech Enhancement
Code Code Available 15 D4AM: A General Denoising Framework for Downstream Acoustic Models Nov 28, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 HiFi-Stream: Streaming Speech Enhancement with Generative Adversarial Networks Mar 21, 2025 Speech Enhancement
Code Code Available 15 Continual self-training with bootstrapped remixing for speech enhancement Oct 19, 2021 Domain Adaptation Speech Enhancement
Code Code Available 15 High Fidelity Speech Enhancement with Band-split RNN Dec 1, 2022 Speech Enhancement Vocal Bursts Intensity Prediction
Code Code Available 15 A Refining Underlying Information Framework for Monaural Speech Enhancement Dec 18, 2023 Speech Enhancement
Code Code Available 15 dEchorate: a Calibrated Room Impulse Response Database for Echo-aware Signal Processing Apr 27, 2021 Benchmarking Retrieval
Code Code Available 15 Complex-valued Spatial Autoencoders for Multichannel Speech Enhancement Aug 6, 2021 Speech Enhancement
Code Code Available 15 A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech Aug 27, 2020 CPU Speech Enhancement
Code Code Available 15 An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and Separation Aug 21, 2020 Deep Learning Speech Enhancement
Code Code Available 15 Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features Nov 3, 2021 Prediction Speech Enhancement
Code Code Available 15 A Causal U-net based Neural Beamforming Network for Real-Time Multi-Channel Speech Enhancement Aug 1, 2021 CPU Speech Enhancement
Code Code Available 15 DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement Dec 15, 2022 Denoising Speech Dereverberation
Code Code Available 15 A Modulation-Domain Loss for Neural-Network-based Real-time Speech Enhancement Feb 15, 2021 Speaker Identification Speech Denoising
Code Code Available 15 Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Probabilistic Models Sep 14, 2023 Speaker Verification Speech Enhancement
Code Code Available 15 A light-weight full-band speech enhancement model Jun 29, 2022 Speech Enhancement
Code Code Available 15 A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement Aug 26, 2021 Speech Enhancement
Code Code Available 15