AV-CrossNet: an Audiovisual Complex Spectral Mapping Network for Speech Separation By Leveraging Narrow- and Cross-Band Modeling Jun 17, 2024 Speaker Separation Speech Enhancement
Code Code Available 15 Binaural Speech Enhancement Using Deep Complex Convolutional Transformer Networks Mar 8, 2024 Decoder Speech Enhancement
Code Code Available 15 A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition May 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Explainable DNN-based Beamformer with Postfilter Nov 16, 2024 Speech Enhancement
Code Code Available 15 Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement Jul 25, 2020 regression Speech Enhancement
Code Code Available 15 Adaptive Convolution for CNN-based Speech Enhancement Models Feb 20, 2025 Decoder Speech Enhancement
Code Code Available 15 A non-causal FFTNet architecture for speech enhancement Jun 8, 2020 Speech Enhancement
Code Code Available 15 FaSNet: Low-latency Adaptive Beamforming for Multi-microphone Audio Processing Sep 29, 2019 Speech Enhancement speech-recognition
Code Code Available 15 Simulating room transfer functions between transducers mounted on audio devices using a modified image source method Sep 7, 2023 Computational Efficiency parameter estimation
Code Code Available 15 SpeechLMScore: Evaluating speech generation using speech language model Dec 8, 2022 Language Modeling Language Modelling
Code Code Available 15 DNN-based mask estimation for distributed speech enhancement in spatially unconstrained microphone arrays Nov 3, 2020 Diversity Noise Estimation
Code Code Available 15 A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement Aug 26, 2021 Speech Enhancement
Code Code Available 15 An Overview of Deep-Learning-Based Audio-Visual Speech Enhancement and Separation Aug 21, 2020 Deep Learning Speech Enhancement
Code Code Available 15 A light-weight full-band speech enhancement model Jun 29, 2022 Speech Enhancement
Code Code Available 15 A Perceptually-Motivated Approach for Low-Complexity, Real-Time Enhancement of Fullband Speech Aug 27, 2020 CPU Speech Enhancement
Code Code Available 15 AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection Jan 5, 2019 Active Speaker Detection Audio-Visual Active Speaker Detection
Code Code Available 15 Self-Attention Generative Adversarial Network for Speech Enhancement Oct 18, 2020 Generative Adversarial Network Speech Enhancement
Code Code Available 15 Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition Feb 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Disentanglement in a GAN for Unconditional Speech Synthesis Jul 4, 2023 Disentanglement Generative Adversarial Network
Code Code Available 15 Group Communication with Context Codec for Lightweight Source Separation Dec 14, 2020 Decoder Speech Enhancement
Code Code Available 15 Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement Oct 13, 2021 Speech Enhancement
Code Code Available 15 HiFi-GAN: High-Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks Jun 10, 2020 Denoising Speech Dereverberation
Code Code Available 15 CDPAM: Contrastive learning for perceptual audio similarity Feb 9, 2021 Contrastive Learning Speech Enhancement
Code Code Available 15 A Refining Underlying Information Framework for Monaural Speech Enhancement Dec 18, 2023 Speech Enhancement
Code Code Available 15 Stabilizing Label Assignment for Speech Separation by Self-supervised Pre-training Oct 29, 2020 Speaker Separation Speech Enhancement
Code Code Available 15 Diffusion-based Generative Speech Source Separation Oct 31, 2022 Speech Enhancement
Code Code Available 15 Diffusion-Based Mel-Spectrogram Enhancement for Personalized Speech Synthesis with Found Data May 18, 2023 Speech Enhancement Speech Synthesis
Code Code Available 15 Improved Lite Audio-Visual Speech Enhancement Aug 30, 2020 Speech Enhancement
Code Code Available 15 Improving GANs for Speech Enhancement Jan 15, 2020 Speech Enhancement
Code Code Available 15 CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile Application Aug 21, 2020 Acoustic Scene Classification Data Augmentation
Code Code Available 15 DeFTAN-II: Efficient Multichannel Speech Enhancement with Subgroup Processing Aug 30, 2023 Speech Enhancement
Code Code Available 15 Improving Speech Enhancement through Fine-Grained Speech Characteristics Jul 1, 2022 Deep Learning Speech Enhancement
Code Code Available 15 DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement Dec 15, 2022 Denoising Speech Dereverberation
Code Code Available 15 A Mask Free Neural Network for Monaural Speech Enhancement Jun 7, 2023 Speech Enhancement
Code Code Available 15 Integrating Uncertainty into Neural Network-based Speech Enhancement May 15, 2023 Speech Enhancement
Code Code Available 15 Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Probabilistic Models Sep 14, 2023 Speaker Verification Speech Enhancement
Code Code Available 15 Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition Mar 28, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Deep Residual-Dense Lattice Network for Speech Enhancement Feb 27, 2020 Speech Enhancement
Code Code Available 15 SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection Nov 11, 2022 Speech Enhancement
Code Code Available 15 Investigating the Design Space of Diffusion Models for Speech Enhancement Dec 7, 2023 Image Generation Speech Enhancement
Code Code Available 15 Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features Nov 3, 2021 Prediction Speech Enhancement
Code Code Available 15 SEANet: A Multi-modal Speech Enhancement Network Sep 4, 2020 Speech Enhancement
Code Code Available 15 dEchorate: a Calibrated Room Impulse Response Database for Echo-aware Signal Processing Apr 27, 2021 Benchmarking Retrieval
Code Code Available 15 D4AM: A General Denoising Framework for Downstream Acoustic Models Nov 28, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Continual self-training with bootstrapped remixing for speech enhancement Oct 19, 2021 Domain Adaptation Speech Enhancement
Code Code Available 15 Lite Audio-Visual Speech Enhancement May 24, 2020 Data Compression Denoising
Code Code Available 15 SEF-PNet: Speaker Encoder-Free Personalized Speech Enhancement with Local and Global Contexts Aggregation Jan 20, 2025 Speaker Verification Speech Enhancement
Code Code Available 15 Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement Mar 4, 2022 Active Speaker Detection Multi-Task Learning
Code Code Available 15 Semi-Supervised Multichannel Speech Enhancement With a Deep Speech Prior Oct 7, 2019 Speech Enhancement
Code Code Available 15 SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing Oct 14, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15