Metis: A Foundation Speech Generation Model with Masked Generative Pre-training Feb 5, 2025 Self-Supervised Learning Speech Enhancement
Code Code Available 95 Hybrid Transformers for Music Source Separation Nov 15, 2022 Music Source Separation Speech Enhancement
Code Code Available 55 DeepFilterNet2: Towards Real-Time Speech Enhancement on Embedded Devices for Full-Band Audio May 11, 2022 CPU Data Augmentation
Code Code Available 45 TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch Oct 27, 2023 Self-Supervised Learning Speech Enhancement
Code Code Available 45 Deep Multi-Frame Filtering for Hearing Aids May 14, 2023 Speech Enhancement
Code Code Available 45 DeepFilterNet: Perceptually Motivated Real-Time Speech Enhancement May 14, 2023 CPU Speech Enhancement
Code Code Available 45 Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Sep 20, 2018 Multi-task Audio Source Seperation Music Source Separation
Code Code Available 35 An Investigation of Incorporating Mamba for Speech Enhancement May 10, 2024 Mamba Speech Enhancement
Code Code Available 35 Separate Anything You Describe Aug 9, 2023 Audio Source Separation Natural Language Queries
Code Code Available 35 Real-Time Packet Loss Concealment With Mixed Generative and Predictive Model May 11, 2022 Packet Loss Concealment Speech Enhancement
Code Code Available 35 Apollo: Band-sequence Modeling for High-Quality Audio Restoration Sep 13, 2024 Computational Efficiency Speech Enhancement
Code Code Available 35 SoundStream: An End-to-End Neural Audio Codec Jul 7, 2021 CPU Decoder
Code Code Available 35 EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation Jun 10, 2024 Speech Enhancement
Code Code Available 35 VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration Apr 12, 2022 Speech Denoising Speech Enhancement
Code Code Available 35 SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios Oct 2, 2024 Speech Enhancement Speech Separation
Code Code Available 35 TF-Locoformer: Transformer with Local Modeling by Convolution for Speech Separation and Enhancement Aug 6, 2024 Speech Enhancement Speech Separation
Code Code Available 25 DeepFilterNet: A Low Complexity Speech Enhancement Framework for Full-Band Audio based on Deep Filtering Oct 11, 2021 Speech Enhancement
Code Code Available 25 Towards Ultra-Low-Power Neuromorphic Speech Enhancement with Spiking-FullSubNet Oct 7, 2024 Denoising Speech Denoising
Code Code Available 25 Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech Feb 26, 2024 Quantization Speech Enhancement
Code Code Available 25 Speech Denoising in the Waveform Domain with Self-Attention Feb 15, 2022 Decoder Denoising
Code Code Available 25 Training-Free Multi-Step Audio Source Separation May 26, 2025 Audio Source Separation Denoising
Code Code Available 25 Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement Dec 21, 2024 Mamba
Code Code Available 25 Mamba in Speech: Towards an Alternative to Self-Attention May 21, 2024 Mamba Speech Enhancement
Code Code Available 25 SEGAN: Speech Enhancement Generative Adversarial Network Mar 28, 2017 Generative Adversarial Network Speech Enhancement
Code Code Available 25 Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency Losses Feb 3, 2021 Decoder Speech Denoising
Code Code Available 25 LiSenNet: Lightweight Sub-band and Dual-Path Modeling for Real-Time Speech Enhancement Sep 20, 2024 Speech Enhancement
Code Code Available 25 IndicVoices-R: Unlocking a Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS Sep 9, 2024 Denoising Speech Enhancement
Code Code Available 25 StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation Dec 22, 2022 Speech Dereverberation Speech Enhancement
Code Code Available 25 LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement Mar 1, 2025 Language Modeling Language Modelling
Code Code Available 25 MP-SENet: A Speech Enhancement Model with Parallel Denoising of Magnitude and Phase Spectra May 23, 2023 Decoder Denoising
Code Code Available 25 FSPEN: AN ULTRA-LIGHTWEIGHT NETWORK FOR REAL TIME SPEECH ENAHNCMENT Apr 15, 2024 Speech Enhancement
Code Code Available 25 CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement Sep 22, 2022 Audio Super-Resolution Automatic Speech Recognition
Code Code Available 25 FlowSE: Efficient and High-Quality Speech Enhancement via Flow Matching May 26, 2025 Quantization Speech Enhancement
Code Code Available 25 FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement Mar 23, 2022 Speech Enhancement
Code Code Available 25 Efficient Speech Enhancement via Embeddings from Pre-trained Generative Audioencoders Jun 13, 2025 Speech Enhancement
Code Code Available 25 FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow Matching Jan 9, 2025 Audio Super-Resolution Computational Efficiency
Code Code Available 25 Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement Aug 17, 2023 Bandwidth Extension Decoder
Code Code Available 25 CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR Feb 27, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 25 ICASSP 2022 Acoustic Echo Cancellation Challenge Feb 27, 2022 Acoustic echo cancellation Speech Enhancement
Code Code Available 25 ICASSP 2023 Acoustic Echo Cancellation Challenge Sep 22, 2023 Acoustic echo cancellation Speech Enhancement
Code Code Available 25 A Lightweight Hybrid Dual Channel Speech Enhancement System under Low-SNR Conditions May 26, 2025 Speech Enhancement
Code Code Available 25 LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT Oct 7, 2023 Audio captioning Automatic Speech Recognition
Code Code Available 25 CoGenAV: Versatile Audio-Visual Representation Learning via Contrastive-Generative Synchronization May 6, 2025 Active Speaker Detection Audio-Visual Speech Recognition
Code Code Available 25 Conditional Diffusion Probabilistic Model for Speech Enhancement Feb 10, 2022 model Speech Enhancement
Code Code Available 25 Direction-Aware Adaptive Online Neural Speech Enhancement with an Augmented Reality Headset in Real Noisy Conversational Environments Jul 15, 2022 blind source separation Speech Enhancement
Code Code Available 25 MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement Jul 1, 2025 Automatic Speech Recognition Mamba
Code Code Available 25 CMGAN: Conformer-based Metric GAN for Speech Enhancement Mar 28, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 25 Fast FullSubNet: Accelerate Full-band and Sub-band Fusion Model for Single-channel Speech Enhancement Dec 18, 2022 Computational Efficiency Speech Enhancement
Code Code Available 25 Real Time Speech Enhancement in the Waveform Domain Jun 23, 2020 CPU Data Augmentation
Code Code Available 25 Proximal Policy Optimization Algorithms Jul 20, 2017 Continuous Control Dota 2
Code Code Available 25