Linguistic Knowledge Transfer Learning for Speech Enhancement Mar 10, 2025 Speech Enhancement Transfer Learning
— Unverified 0ProSE: Diffusion Priors for Speech Enhancement Mar 9, 2025 Denoising regression
— Unverified 0UL-UNAS: Ultra-Lightweight U-Nets for Real-Time Speech Enhancement via Network Architecture Search Mar 1, 2025 Neural Architecture Search Speech Enhancement
Code Code Available 2LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement Mar 1, 2025 Language Modeling Language Modelling
Code Code Available 2CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR Feb 27, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2PrimeK-Net: Multi-scale Spectral Learning via Group Prime-Kernel Convolutional Neural Networks for Single Channel Speech Enhancement Feb 27, 2025 Computational Efficiency Speech Enhancement
Code Code Available 1Enhancing Speech Quality through the Integration of BGRU and Transformer Architectures Feb 25, 2025 Speech Enhancement
— Unverified 0Speech Enhancement Using Continuous Embeddings of Neural Audio Codec Feb 22, 2025 Quantization Speech Enhancement
— Unverified 0Adaptive Convolution for CNN-based Speech Enhancement Models Feb 20, 2025 Decoder Speech Enhancement
Code Code Available 1LMFCA-Net: A Lightweight Model for Multi-Channel Speech Enhancement with Efficient Narrow-Band and Cross-Band Attention Feb 17, 2025 Speech Enhancement
— Unverified 0TAPS: Throat and Acoustic Paired Speech Dataset for Deep Learning-Based Speech Enhancement Feb 17, 2025 Speech Enhancement
— Unverified 0Microphone Array Geometry Independent Multi-Talker Distant ASR: NTT System for the DASR Task of the CHiME-8 Challenge Feb 14, 2025 Action Detection Activity Detection
— Unverified 0Advances in Microphone Array Processing and Multichannel Speech Enhancement Feb 13, 2025 Speech Enhancement
— Unverified 0GenSE: Generative Speech Enhancement via Language Models using Hierarchical Modeling Feb 5, 2025 Language Modeling Language Modelling
— Unverified 0Metis: A Foundation Speech Generation Model with Masked Generative Pre-training Feb 5, 2025 Self-Supervised Learning Speech Enhancement
Code Code Available 9Learning-based A Posteriori Speech Presence Probability Estimation and Applications Jan 23, 2025 Speech Enhancement speech-recognition
— Unverified 0Bridging The Multi-Modality Gaps of Audio, Visual and Linguistic for Speech Enhancement Jan 23, 2025 Audio Signal Processing Speech Enhancement
— Unverified 0Generative Data Augmentation Challenge: Zero-Shot Speech Synthesis for Personalized Speech Enhancement Jan 23, 2025 Data Augmentation Speech Enhancement
— Unverified 0UP-Cycle-SENet: Unpaired Phase-aware Speech Enhancement Using Deep Complex Cycle Adversarial Networks Jan 22, 2025 Speech Enhancement
— Unverified 0Let SSMs be ConvNets: State-space Modeling with Optimal Tensor Contractions Jan 22, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Speech Enhancement with Overlapped-Frame Information Fusion and Causal Self-Attention Jan 21, 2025 Speech Enhancement
Code Code Available 0SEF-PNet: Speaker Encoder-Free Personalized Speech Enhancement with Local and Global Contexts Aggregation Jan 20, 2025 Speaker Verification Speech Enhancement
Code Code Available 1DFingerNet: Noise-Adaptive Speech Enhancement for Hearing Aids Jan 17, 2025 Denoising Speech Enhancement
— Unverified 0Microphone Array Signal Processing and Deep Learning for Speech Enhancement Jan 13, 2025 Deep Learning Diversity
— Unverified 0Multi-modal Speech Enhancement with Limited Electromyography Channels Jan 11, 2025 Electromyography (EMG) Speech Enhancement
— Unverified 0Estimation and Restoration of Unknown Nonlinear Distortion using Diffusion Jan 10, 2025 Audio Effects Modeling Quantization
Code Code Available 0xLSTM-SENet: xLSTM for Single-Channel Speech Enhancement Jan 10, 2025 Mamba Speech Enhancement
Code Code Available 2AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder Jan 9, 2025 Pitch Classification Pitch control
Code Code Available 1FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching Jan 9, 2025 Audio Super-Resolution Computational Efficiency
Code Code Available 2Artifact-free Sound Quality in DNN-based Closed-loop Systems for Audio Processing Jan 7, 2025 Speech Enhancement
— Unverified 0Causal Speech Enhancement with Predicting Semantics based on Quantized Self-supervised Learning Features Dec 26, 2024 Multi-Task Learning Quantization
— Unverified 0Neural Directed Speech Enhancement with Dual Microphone Array in High Noise Scenario Dec 24, 2024 Speech Enhancement
— Unverified 0From KAN to GR-KAN: Advancing Speech Enhancement with KAN-Based Methodology Dec 23, 2024 Kolmogorov-Arnold Networks Speech Enhancement
— Unverified 0Time-Graph Frequency Representation with Singular Value Decomposition for Neural Speech Enhancement Dec 22, 2024 Speech Enhancement
Code Code Available 0Scalable Speech Enhancement with Dynamic Channel Pruning Dec 22, 2024 Speech Enhancement
— Unverified 0Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement Dec 21, 2024 Mamba
Code Code Available 2Scale This, Not That: Investigating Key Dataset Attributes for Efficient Speech Enhancement Scaling Dec 19, 2024 Attribute Speech Enhancement
— Unverified 0Investigating the Effects of Diffusion-based Conditional Generative Speech Models Used for Speech Enhancement on Dysarthric Speech Dec 18, 2024 Speech Enhancement
— Unverified 0Evaluating the Impact of Discriminative and Generative E2E Speech Enhancement Models on Syllable Stress Preservation Dec 11, 2024 Speech Enhancement
— Unverified 0TouchTTS: An Embarrassingly Simple TTS Framework that Everyone Can Touch Dec 11, 2024 Denoising speaker-diarization
— Unverified 0Source Separation & Automatic Transcription for Music Dec 9, 2024 Music Transcription Speech Enhancement
Code Code Available 1SALMONN-omni: A Codec-free LLM for Full-duplex Speech Understanding and Generation Nov 27, 2024 Question Answering Speech Enhancement
— Unverified 0Towards Advanced Speech Signal Processing: A Statistical Perspective on Convolution-Based Architectures and its Applications Nov 20, 2024 Emotion Recognition Speaker Identification
— Unverified 0GhostRNN: Reducing State Redundancy in RNN with Cheap Operations Nov 20, 2024 Keyword Spotting Speech Enhancement
— Unverified 0A Neural Denoising Vocoder for Clean Waveform Generation from Noisy Mel-Spectrogram based on Amplitude and Phase Predictions Nov 19, 2024 Denoising Speech Enhancement
— Unverified 0Explainable DNN-based Beamformer with Postfilter Nov 16, 2024 Speech Enhancement
Code Code Available 1SAV-SE: Scene-aware Audio-Visual Speech Enhancement with Selective State Space Model Nov 12, 2024 Mamba Speech Enhancement
— Unverified 0DCF-DS: Deep Cascade Fusion of Diarization and Separation for Speech Recognition under Realistic Single-Channel Conditions Nov 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Selective State Space Model for Monaural Speech Enhancement Nov 9, 2024 Mamba Speech Enhancement
— Unverified 0Modulating State Space Model with SlowFast Framework for Compute-Efficient Ultra Low-Latency Speech Enhancement Nov 4, 2024 Speech Enhancement
— Unverified 0