ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding Jul 19, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving spatial cues for hearables using a parameterized binaural CDR estimator Jul 17, 2022 Speech Enhancement
— Unverified 0Multi-channel target speech enhancement based on ERB-scaled spatial coherence features Jul 17, 2022 Speech Enhancement
— Unverified 0Direction-Aware Adaptive Online Neural Speech Enhancement with an Augmented Reality Headset in Real Noisy Conversational Environments Jul 15, 2022 blind source separation Speech Enhancement
Code Code Available 2Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational Environments Jul 15, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Speech Enhancement through Fine-Grained Speech Characteristics Jul 1, 2022 Deep Learning Speech Enhancement
Code Code Available 1Improving Visual Speech Enhancement Network by Learning Audio-visual Affinity with Multi-head Attention Jun 30, 2022 Decoder Speech Enhancement
— Unverified 0GLD-Net: Improving Monaural Speech Enhancement by Learning Global and Local Dependency Features with GLD Block Jun 30, 2022 Decoder Speech Enhancement
— Unverified 0A light-weight full-band speech enhancement model Jun 29, 2022 Speech Enhancement
Code Code Available 1Challenges and Opportunities in Multi-device Speech Processing Jun 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Insights Into Deep Non-linear Filters for Improved Multi-channel Speech Enhancement Jun 27, 2022 Speech Enhancement
Code Code Available 1SAQAM: Spatial Audio Quality Assessment Metric Jun 24, 2022 Audio Quality Assessment Multi-Task Learning
— Unverified 0Efficient Transformer-based Speech Enhancement Using Long Frames and STFT Magnitudes Jun 23, 2022 Speech Enhancement Speech Separation
— Unverified 0A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement Jun 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1On the Role of Spatial, Spectral, and Temporal Processing for DNN-based Non-linear Multi-channel Speech Enhancement Jun 22, 2022 Speech Enhancement Speech Extraction
Code Code Available 1Multi-channel end-to-end neural network for speech enhancement, source localization, and voice activity detection Jun 20, 2022 Action Detection Activity Detection
— Unverified 00/1 Deep Neural Networks via Block Coordinate Descent Jun 19, 2022 10-shot image generation
— Unverified 0NASTAR: Noise Adaptive Speech Enhancement with Target-Conditional Resampling Jun 18, 2022 Retrieval Speech Enhancement
— Unverified 0Adversarial Privacy Protection on Speech Enhancement Jun 16, 2022 Speech Enhancement
Code Code Available 0To Dereverb Or Not to Dereverb? Perceptual Studies On Real-Time Dereverberation Targets Jun 16, 2022 Denoising Speech Enhancement
— Unverified 0EPG2S: Speech Generation and Speech Enhancement based on Electropalatography and Audio Signals using Multimodal Learning Jun 16, 2022 Speech Enhancement
— Unverified 0Universal Speech Enhancement with Score-based Diffusion Jun 7, 2022 Speech Enhancement
Code Code Available 1Canonical Cortical Graph Neural Networks and its Application for Speech Enhancement in Audio-Visual Hearing Aids Jun 6, 2022 BIG-bench Machine Learning Speech Enhancement
— Unverified 0Far-Field Speaker Recognition Benchmark Derived From The DiPCo Corpus Jun 1, 2022 Denoising Speaker Recognition
— Unverified 0Joint Training of Speech Enhancement and Self-supervised Model for Noise-robust ASR May 26, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0NeuralEcho: A Self-Attentive Recurrent Neural Network For Unified Acoustic Echo Suppression And Speech Enhancement May 20, 2022 Acoustic echo cancellation Speech Enhancement
— Unverified 0U-Former: Improving Monaural Speech Enhancement with Multi-head Self and Cross Attention May 18, 2022 Decoder Speech Enhancement
Code Code Available 1Dictionary-Based Fusion of Contact and Acoustic Microphones for Wind Noise Reduction May 18, 2022 Speech Enhancement
— Unverified 0Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments May 17, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Task splitting for DNN-based acoustic echo and noise removal May 13, 2022 Acoustic echo cancellation Speech Enhancement
— Unverified 0DeepFilterNet2: Towards Real-Time Speech Enhancement on Embedded Devices for Full-Band Audio May 11, 2022 CPU Data Augmentation
Code Code Available 4A deep representation learning speech enhancement method using β-VAE May 11, 2022 Disentanglement Representation Learning
— Unverified 0Real-Time Packet Loss Concealment With Mixed Generative and Predictive Model May 11, 2022 Packet Loss Concealment Speech Enhancement
Code Code Available 3Generalized Fast Multichannel Nonnegative Matrix Factorization Based on Gaussian Scale Mixtures for Blind Source Separation May 11, 2022 blind source separation Speech Enhancement
— Unverified 0Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech Recognition May 9, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Acoustic echo suppression using a learning-based multi-frame minimum variance distortionless response filter May 7, 2022 parameter estimation Speech Enhancement
— Unverified 0On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training May 3, 2022 Robust Speech Recognition Speech Enhancement
— Unverified 0Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention May 3, 2022 Decoder Multi-Task Learning
— Unverified 0A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network May 2, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improved far-field speech recognition using Joint Variational Autoencoder Apr 24, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0RadioSES: mmWave-Based Audioradio Speech Enhancement and Separation System Apr 14, 2022 Speech Enhancement Speech Separation
— Unverified 0Receptive Field Analysis of Temporal Convolutional Networks for Monaural Speech Dereverberation Apr 13, 2022 Speech Dereverberation Speech Enhancement
Code Code Available 0VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration Apr 12, 2022 Speech Denoising Speech Enhancement
Code Code Available 3Listen only to me! How well can target speech extraction handle false alarms? Apr 11, 2022 Speaker Identification Speaker Verification
— Unverified 0Exploiting Hidden Representations from a DNN-based Speech Recogniser for Speech Intelligibility Prediction in Hearing-impaired Listeners Apr 8, 2022 Prediction Speech Enhancement
Code Code Available 0Boosting Self-Supervised Embeddings for Speech Enhancement Apr 7, 2022 Self-Supervised Learning Speech Enhancement
Code Code Available 1FFC-SE: Fast Fourier Convolution for Speech Enhancement Apr 6, 2022 Speech Enhancement
— Unverified 0Expression-preserving face frontalization improves visually assisted speech processing Apr 6, 2022 Face Model Lip Reading
— Unverified 0Audio-visual multi-channel speech separation, dereverberation and recognition Apr 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Complex Recurrent Variational Autoencoder with Application to Speech Enhancement Apr 5, 2022 Speech Enhancement
Code Code Available 0