SOTAVerified

Speech Denoising

Obtain the clean speech of the target speaker by suppressing the background noise. Recent representative github platform ClearerVoice-Studio

Papers

Showing 150 of 65 papers

TitleStatusHype
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech ProcessingCode3
VoiceFixer: A Unified Framework for High-Fidelity Speech RestorationCode3
Towards Ultra-Low-Power Neuromorphic Speech Enhancement with Spiking-FullSubNetCode2
Speech Denoising in the Waveform Domain with Self-AttentionCode2
Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency LossesCode2
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech EnhancementCode2
CMGAN: Conformer-Based Metric-GAN for Monaural Speech EnhancementCode2
FRA-RIR: Fast Random Approximation of the Image-source MethodCode2
Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text RepresentationsCode1
WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech EnhancementCode1
Self-Supervised Speech Denoising Using Only Noisy Audio SignalsCode1
Speech Denoising with Auditory ModelsCode1
BirdSoundsDenoising: Deep Visual Audio Denoising for Bird SoundsCode1
Speech Denoising with Deep Feature LossesCode1
Speech Denoising Without Clean Training Data: A Noise2Noise ApproachCode1
Listening to Sounds of Silence for Speech DenoisingCode1
CleanUMamba: A Compact Mamba Network for Speech Denoising using Channel PruningCode1
A Modulation-Domain Loss for Neural-Network-based Real-time Speech EnhancementCode1
Speech Denoising by Accumulating Per-Frequency Modeling FluctuationsCode1
Visual Speech Enhancement Without A Real Visual StreamCode1
Analysing Diffusion-based Generative Approaches versus Discriminative Approaches for Speech RestorationCode0
Boosted Locality Sensitive Hashing: Discriminative Binary Codes for Source SeparationCode0
Generative Speech Foundation Model Pretraining for High-Quality Speech Extraction and RestorationCode0
Investigating the effect of residual and highway connections in speech enhancement modelsCode0
Sparse Mixture of Local Experts for Efficient Speech EnhancementCode0
Joint Optimization of Masks and Deep Recurrent Neural Networks for Monaural Source SeparationCode0
Speech Denoising Convolutional Neural Network trained with Deep Feature Losses.Code0
Speech Dereverberation with a Reverberation Time Shortening TargetCode0
Phase-aware Single-stage Speech Denoising and Dereverberation with U-NetCode0
Let SSMs be ConvNets: State-space Modeling with Optimal Tensor ContractionsCode0
Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix FactorizationCode0
Task-specific Optimization of Virtual Channel Linear Prediction-based Speech Dereverberation Front-End for Far-Field Speaker VerificationCode0
Perceptual Loss based Speech Denoising with an ensemble of Audio Pattern Recognition and Self-Supervised ModelsCode0
Zero-Shot Personalized Speech Enhancement through Speaker-Informed Model Selection0
A Multi-Stage Triple-Path Method for Speech Separation in Noisy and Reverberant Environments0
An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS0
Are Deep Speech Denoising Models Robust to Adversarial Noise?0
A Review of Multi-Objective Deep Learning Speech Denoising Methods0
CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram0
Complexity Scaling for Speech Denoising0
Deep Speech Denoising with Vector Space Projections0
DENOASR: Debiasing ASRs through Selective Denoising0
FB-MSTCN: A Full-Band Single-Channel Speech Enhancement Method Based on Multi-Scale Temporal Convolutional Network0
Incorporating Multi-Target in Multi-Stage Speech Enhancement Model for Better Generalization0
Knowledge Distillation for Speech Denoising by Latent Representation Alignment with Cosine Distance0
Learning robust speech representation with an articulatory-regularized variational autoencoder0
Multi-Channel Speech Denoising for Machine Ears0
Perceptual-based deep-learning denoiser as a defense against adversarial attacks on ASR systems0
AeGAN: Time-Frequency Speech Denoising via Generative Adversarial Networks0
aTENNuate: Optimized Real-time Speech Enhancement with Deep SSMs on Raw Audio0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CBAK2.41Unverified
#ModelMetricClaimedVerifiedStatus
1CBAK2.47Unverified