SOTAVerified

Speech Denoising

Obtain the clean speech of the target speaker by suppressing the background noise. Recent representative github platform ClearerVoice-Studio

Papers

Showing 150 of 65 papers

TitleStatusHype
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech ProcessingCode3
VoiceFixer: A Unified Framework for High-Fidelity Speech RestorationCode3
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech EnhancementCode2
Towards Ultra-Low-Power Neuromorphic Speech Enhancement with Spiking-FullSubNetCode2
Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency LossesCode2
Speech Denoising in the Waveform Domain with Self-AttentionCode2
FRA-RIR: Fast Random Approximation of the Image-source MethodCode2
CMGAN: Conformer-Based Metric-GAN for Monaural Speech EnhancementCode2
WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech EnhancementCode1
Self-Supervised Speech Denoising Using Only Noisy Audio SignalsCode1
BirdSoundsDenoising: Deep Visual Audio Denoising for Bird SoundsCode1
Speech Denoising with Auditory ModelsCode1
Speech Denoising with Deep Feature LossesCode1
Speech Denoising Without Clean Training Data: A Noise2Noise ApproachCode1
Speech Denoising by Accumulating Per-Frequency Modeling FluctuationsCode1
A Modulation-Domain Loss for Neural-Network-based Real-time Speech EnhancementCode1
Listening to Sounds of Silence for Speech DenoisingCode1
Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text RepresentationsCode1
Visual Speech Enhancement Without A Real Visual StreamCode1
CleanUMamba: A Compact Mamba Network for Speech Denoising using Channel PruningCode1
Phase-aware Single-stage Speech Denoising and Dereverberation with U-NetCode0
Investigating the effect of residual and highway connections in speech enhancement modelsCode0
Joint Optimization of Masks and Deep Recurrent Neural Networks for Monaural Source SeparationCode0
Sparse Mixture of Local Experts for Efficient Speech EnhancementCode0
Speech Denoising Convolutional Neural Network trained with Deep Feature Losses.Code0
Speech Dereverberation with a Reverberation Time Shortening TargetCode0
Let SSMs be ConvNets: State-space Modeling with Optimal Tensor ContractionsCode0
Supervised and Unsupervised Speech Enhancement Using Nonnegative Matrix FactorizationCode0
Task-specific Optimization of Virtual Channel Linear Prediction-based Speech Dereverberation Front-End for Far-Field Speaker VerificationCode0
Boosted Locality Sensitive Hashing: Discriminative Binary Codes for Source SeparationCode0
Perceptual Loss based Speech Denoising with an ensemble of Audio Pattern Recognition and Self-Supervised ModelsCode0
TSTNN: Two-stage Transformer based Neural Network for Speech Enhancement in the Time Domain0
uSee: Unified Speech Enhancement and Editing with Conditional Diffusion Models0
Active Speech Enhancement: Active Speech Denoising Decliping and Deveraberation0
Zero-Shot Personalized Speech Enhancement through Speaker-Informed Model Selection0
A Multi-Stage Triple-Path Method for Speech Separation in Noisy and Reverberant Environments0
Analysing Diffusion-based Generative Approaches versus Discriminative Approaches for Speech Restoration0
An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS0
Are Deep Speech Denoising Models Robust to Adversarial Noise?0
A Review of Multi-Objective Deep Learning Speech Denoising Methods0
CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram0
Complexity Scaling for Speech Denoising0
Deep Speech Denoising with Vector Space Projections0
DENOASR: Debiasing ASRs through Selective Denoising0
FB-MSTCN: A Full-Band Single-Channel Speech Enhancement Method Based on Multi-Scale Temporal Convolutional Network0
Generative Speech Foundation Model Pretraining for High-Quality Speech Extraction and Restoration0
Incorporating Multi-Target in Multi-Stage Speech Enhancement Model for Better Generalization0
Knowledge Distillation for Speech Denoising by Latent Representation Alignment with Cosine Distance0
Learning robust speech representation with an articulatory-regularized variational autoencoder0
Multi-Channel Speech Denoising for Machine Ears0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CBAK2.41Unverified
#ModelMetricClaimedVerifiedStatus
1CBAK2.47Unverified