Exploration of Adapter for Noise Robust Automatic Speech Recognition Feb 28, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Audio-Visual Speech Enhancement in Noisy Environments via Emotion-Based Contextual Cues Feb 26, 2024 Decoder Speech Enhancement
— Unverified 0Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech Feb 26, 2024 Quantization Speech Enhancement
Code Code Available 2SICRN: Advancing Speech Enhancement through State Space Model and Inplace Convolution Techniques Feb 22, 2024 Speech Enhancement
— Unverified 0Mel-FullSubNet: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR Feb 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Plugin Speech Enhancement: A Universal Speech Enhancement Framework Inspired by Dynamic Neural Network Feb 20, 2024 Data Augmentation Speech Enhancement
— Unverified 0SECP: A Speech Enhancement-Based Curation Pipeline For Scalable Acquisition Of Clean Speech Feb 19, 2024 Speech Enhancement
— Unverified 0Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model Feb 16, 2024 Denoising Speech Enhancement
— Unverified 0Diffusion Models for Audio Restoration Feb 15, 2024 Speech Enhancement
— Unverified 0Overview of the L3DAS23 Challenge on Audio-Visual Extended Reality Feb 14, 2024 Audio Signal Processing Sound Event Localization and Detection
— Unverified 0Unrestricted Global Phase Bias-Aware Single-channel Speech Enhancement with Conformer-based Metric GAN Feb 13, 2024 Speech Enhancement
— Unverified 0Array Geometry-Robust Attention-Based Neural Beamformer for Moving Speakers Feb 5, 2024 Speech Enhancement
— Unverified 0Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge Feb 2, 2024 Domain Adaptation Speech Enhancement
Code Code Available 0Real-time Stereo Speech Enhancement with Spatial-Cue Preservation based on Dual-Path Structure Feb 1, 2024 Speech Enhancement
— Unverified 0An Analysis of the Variance of Diffusion-based Speech Enhancement Feb 1, 2024 Speech Enhancement
— Unverified 0SpeechComposer: Unifying Multiple Speech Tasks with Prompt Composition Jan 31, 2024 Decoder Language Modeling
— Unverified 0Improving Design of Input Condition Invariant Speech Enhancement Jan 25, 2024 Speech Enhancement
Code Code Available 0A Two-Stage Framework in Cross-Spectrum Domain for Real-Time Speech Enhancement Jan 19, 2024 Speech Enhancement
— Unverified 0An Empirical Study on the Impact of Positional Encoding in Transformer-based Monaural Speech Enhancement Jan 18, 2024 POS Position
— Unverified 0On Speech Pre-emphasis as a Simple and Inexpensive Method to Boost Speech Enhancement Jan 17, 2024 Automatic Speech Recognition Speech Enhancement
— Unverified 0Noise-robust zero-shot text-to-speech synthesis conditioned on self-supervised speech-representation model with adapters Jan 10, 2024 Self-Supervised Learning Speech Enhancement
— Unverified 0FADI-AEC: Fast Score Based Diffusion Model Guided by Far-end Signal for Acoustic Echo Cancellation Jan 8, 2024 Acoustic echo cancellation Speech Enhancement
— Unverified 0A unified multichannel far-field speech recognition system: combining neural beamforming with attention based end-to-end model Jan 5, 2024 Speech Enhancement speech-recognition
— Unverified 0Single-channel speech enhancement using learnable loss mixup Dec 20, 2023 Speech Enhancement
— Unverified 0On real-time multi-stage speech enhancement systems Dec 19, 2023 Speech Enhancement
— Unverified 0A Refining Underlying Information Framework for Monaural Speech Enhancement Dec 18, 2023 Speech Enhancement
Code Code Available 1Attention-Driven Multichannel Speech Enhancement in Moving Sound Source Scenarios Dec 17, 2023 Speech Enhancement
— Unverified 0A Deep Representation Learning-based Speech Enhancement Method Using Complex Convolution Recurrent Variational Autoencoder Dec 15, 2023 Representation Learning Speech Enhancement
— Unverified 0SELM: Speech Enhancement Using Discrete Tokens and Language Models Dec 15, 2023 Self-Supervised Learning Speech Enhancement
— Unverified 0Ultra Low Complexity Deep Learning Based Noise Suppression Dec 13, 2023 Deep Learning Speech Enhancement
— Unverified 0ROSE: A Recognition-Oriented Speech Enhancement Framework in Air Traffic Control Using Multi-Objective Learning Dec 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Investigating the Design Space of Diffusion Models for Speech Enhancement Dec 7, 2023 Image Generation Speech Enhancement
Code Code Available 1Diffusion-Based Speech Enhancement in Matched and Mismatched Conditions Using a Heun-Based Sampler Dec 5, 2023 Image Generation Speech Enhancement
— Unverified 0SEFGAN: Harvesting the Power of Normalizing Flows and GANs for Efficient High-Quality Speech Enhancement Dec 4, 2023 Audio Generation Speech Enhancement
— Unverified 0Head Orientation Estimation with Distributed Microphones Using Speech Radiation Patterns Dec 4, 2023 Speech Enhancement
— Unverified 0Subspace Hybrid MVDR Beamforming for Augmented Hearing Nov 30, 2023 Computational Efficiency Speech Enhancement
— Unverified 0D4AM: A General Denoising Framework for Downstream Acoustic Models Nov 28, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1LC4SV: A Denoising Framework Learning to Compensate for Unseen Speaker Verification Models Nov 28, 2023 Denoising Speaker Verification
— Unverified 0CheapNET: Improving Light-weight speech enhancement network by projected loss function Nov 27, 2023 Speech Enhancement
— Unverified 0Cooperative Dual Attention for Audio-Visual Speech Enhancement with Facial Cues Nov 24, 2023 Speech Enhancement
— Unverified 0Sparsity-Driven EEG Channel Selection for Brain-Assisted Speech Enhancement Nov 22, 2023 channel selection EEG
— Unverified 0How does end-to-end speech recognition training impact speech enhancement artifacts? Nov 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SE Territory: Monaural Speech Enhancement Meets the Fixed Virtual Perceptual Space Mapping Nov 3, 2023 Multi-Task Learning Speech Enhancement
— Unverified 0Scenario-Aware Audio-Visual TF-GridNet for Target Speech Extraction Oct 30, 2023 Speaker Separation Speech Enhancement
— Unverified 0DPATD: Dual-Phase Audio Transformer for Denoising Oct 30, 2023 Denoising Speech Enhancement
— Unverified 0TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch Oct 27, 2023 Self-Supervised Learning Speech Enhancement
Code Code Available 4Single channel speech enhancement by colored spectrograms Oct 26, 2023 Denoising Generative Adversarial Network
— Unverified 0Generative Pre-training for Speech with Flow Matching Oct 25, 2023 Speech Enhancement Speech Synthesis
— Unverified 0LC-TTFS: Towards Lossless Network Conversion for Spiking Neural Networks with TTFS Coding Oct 23, 2023 Edge-computing image-classification
— Unverified 0Deep Beamforming for Speech Enhancement and Speaker Localization with an Array Response-Aware Loss Function Oct 19, 2023 Speech Enhancement
— Unverified 0