Fast Real-time Personalized Speech Enhancement: End-to-End Enhancement Network (E3Net) and Knowledge Distillation Apr 2, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0End-to-End Integration of Speech Recognition, Speech Enhancement, and Self-Supervised Learning Representation Apr 1, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multiple Confidence Gates For Joint Training Of SE And ASR Apr 1, 2022 Robust Speech Recognition Speech Enhancement
— Unverified 0Perceptual Contrast Stretching on Target Feature for Speech Enhancement Mar 31, 2022 Speech Enhancement
Code Code Available 1SpecGrad: Diffusion Probabilistic Model based Neural Vocoder with Adaptive Noise Spectral Shaping Mar 31, 2022 Denoising Speech Enhancement
— Unverified 0Speech Enhancement with Score-Based Generative Models in the Complex STFT Domain Mar 31, 2022 Speech Enhancement
Code Code Available 1Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis Mar 31, 2022 Speech Enhancement
Code Code Available 1Phase-Aware Deep Speech Enhancement: It's All About The Frame Length Mar 30, 2022 All Speech Enhancement
— Unverified 0CMGAN: Conformer-based Metric GAN for Speech Enhancement Mar 28, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition Mar 28, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Speech-enhanced and Noise-aware Networks for Robust Speech Recognition Mar 25, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0HiFi++: a Unified Framework for Bandwidth Extension and Speech Enhancement Mar 24, 2022 Audio Generation Bandwidth Extension
Code Code Available 1MetricGAN+/-: Increasing Robustness of Noise Reduction on Unseen Data Mar 23, 2022 Speech Enhancement
— Unverified 0FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement Mar 23, 2022 Speech Enhancement
Code Code Available 2Joint Noise Reduction and Listening Enhancement for Full-End Speech Enhancement Mar 22, 2022 Speech Enhancement
— Unverified 0Investigating self-supervised learning for speech enhancement and separation Mar 15, 2022 Self-Supervised Learning Speech Enhancement
— Unverified 0FB-MSTCN: A Full-Band Single-Channel Speech Enhancement Method Based on Multi-Scale Temporal Convolutional Network Mar 15, 2022 Denoising Speech Denoising
— Unverified 0Exploiting Low-Rank Tensor-Train Deep Neural Networks Based on Riemannian Gradient Descent With Illustrations of Speech Processing Mar 11, 2022 Speech Enhancement Spoken Command Recognition
Code Code Available 0PercepNet+: A Phase and SNR Aware PercepNet for Real-Time Speech Enhancement Mar 4, 2022 Speech Enhancement
— Unverified 0MANNER: Multi-view Attention Network for Noise Erasure Mar 4, 2022 Decoder Speech Enhancement
Code Code Available 1Integrating Statistical Uncertainty into Neural Network-Based Speech Enhancement Mar 4, 2022 Speech Enhancement
— Unverified 0Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement Mar 4, 2022 Active Speaker Detection Multi-Task Learning
Code Code Available 1ICASSP 2022 Acoustic Echo Cancellation Challenge Feb 27, 2022 Acoustic echo cancellation Speech Enhancement
Code Code Available 2Towards Low-distortion Multi-channel Speech Enhancement: The ESPNet-SE Submission to The L3DAS22 Challenge Feb 24, 2022 Speech Enhancement
— Unverified 0Phase Continuity: Learning Derivatives of Phase Spectrum for Speech Enhancement Feb 24, 2022 Speech Enhancement
— Unverified 0The PCG-AIID System for L3DAS22 Challenge: MIMO and MISO convolutional recurrent Network for Multi Channel Speech Enhancement and Speech Recognition Feb 21, 2022 Denoising Speech Denoising
— Unverified 0L3DAS22 Challenge: Learning 3D Audio Sources in a Real Office Environment Feb 21, 2022 Sound Event Localization and Detection Speech Enhancement
Code Code Available 1RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing Feb 17, 2022 Domain Adaptation Speech Enhancement
Code Code Available 1Speech Denoising in the Waveform Domain with Self-Attention Feb 15, 2022 Decoder Denoising
Code Code Available 2EMGSE: Acoustic/EMG Fusion for Multimodal Speech Enhancement Feb 14, 2022 Electromyography (EMG) Speech Enhancement
— Unverified 0Low-latency Monaural Speech Enhancement with Deep Filter-bank Equalizer Feb 14, 2022 Deep Learning Speech Enhancement
— Unverified 0A Novel Speech Intelligibility Enhancement Model based on CanonicalCorrelation and Deep Learning Feb 11, 2022 Speech Enhancement
— Unverified 0Conditional Diffusion Probabilistic Model for Speech Enhancement Feb 10, 2022 model Speech Enhancement
Code Code Available 2Royalflush Speaker Diarization System for ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge Feb 10, 2022 speaker-diarization Speaker Diarization
— Unverified 0Multimodal Audio-Visual Information Fusion using Canonical-Correlated Graph Neural Network for Energy-Efficient Speech Enhancement Feb 9, 2022 Graph Neural Network Representation Learning
— Unverified 0A Speech Intelligibility Enhancement Model based on Canonical Correlation and Deep Learning for Hearing-Assistive Technologies Feb 8, 2022 Speech Enhancement
— Unverified 0Exploring Self-Attention Mechanisms for Speech Separation Feb 6, 2022 Denoising Speech Enhancement
— Unverified 0Optimization of a Real-Time Wavelet-Based Algorithm for Improving Speech Intelligibility Feb 5, 2022 Speech Enhancement Speech-to-Text
— Unverified 0The RoyalFlush System of Speech Recognition for M2MeT Challenge Feb 3, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Joint Speech Recognition and Audio Captioning Feb 3, 2022 AudioCaps Audio captioning
— Unverified 0The impact of removing head movements on audio-visual speech enhancement Feb 1, 2022 Speech Enhancement
— Unverified 0HGCN: Harmonic gated compensation network for speech enhancement Jan 30, 2022 Action Detection Activity Detection
Code Code Available 1A two-step backward compatible fullband speech enhancement system Jan 26, 2022 Speech Enhancement Vocal Bursts Valence Prediction
— Unverified 0A Bayesian Permutation training deep representation learning method for speech enhancement with variational autoencoder Jan 24, 2022 Representation Learning Speech Enhancement
— Unverified 0End-to-End Neural Speech Coding for Real-Time Communications Jan 24, 2022 Decoder Packet Loss Concealment
— Unverified 0How Bad Are Artifacts?: Analyzing the Impact of Speech Enhancement Errors on ASR Jan 18, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Learning to Enhance or Not: Neural Network-Based Switching of Enhanced and Observed Signals for Overlapping Speech Recognition Jan 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0TFCN: Temporal-Frequential Convolutional Network for Single-Channel Speech Enhancement Jan 3, 2022 Speech Enhancement
— Unverified 0Signal-Aware Direction-of-Arrival Estimation Using Attention Mechanisms Jan 3, 2022 Direction of Arrival Estimation Speech Enhancement
— Unverified 0Towards Robust Real-time Audio-Visual Speech Enhancement Dec 16, 2021 Speech Enhancement
— Unverified 0