SOTAVerified

Speech Denoising

Obtain the clean speech of the target speaker by suppressing the background noise. Recent representative github platform ClearerVoice-Studio

Papers

Showing 150 of 65 papers

TitleStatusHype
Active Speech Enhancement: Active Speech Denoising Decliping and Deveraberation0
Knowledge Distillation for Speech Denoising by Latent Representation Alignment with Cosine Distance0
Are Deep Speech Denoising Models Robust to Adversarial Noise?0
Let SSMs be ConvNets: State-space Modeling with Optimal Tensor ContractionsCode0
TouchTTS: An Embarrassingly Simple TTS Framework that Everyone Can Touch0
DENOASR: Debiasing ASRs through Selective Denoising0
CleanUMamba: A Compact Mamba Network for Speech Denoising using Channel PruningCode1
Towards Ultra-Low-Power Neuromorphic Speech Enhancement with Spiking-FullSubNetCode2
Generative Speech Foundation Model Pretraining for High-Quality Speech Extraction and RestorationCode0
aTENNuate: Optimized Real-time Speech Enhancement with Deep SSMs on Raw Audio0
Schrödinger Bridge for Generative Speech Enhancement0
An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS0
Super Denoise Net: Speech Super Resolution with Noise Cancellation in Low Sampling Rate Noisy Environments0
uSee: Unified Speech Enhancement and Editing with Conditional Diffusion Models0
Complexity Scaling for Speech Denoising0
CleanUNet 2: A Hybrid Speech Denoising Model on Waveform and Spectrogram0
Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech EnhancementCode2
Target Speech Extraction with Conditional Diffusion Model0
A Multi-Stage Triple-Path Method for Speech Separation in Noisy and Reverberant Environments0
Miipher: A Robust Speech Restoration Model Integrating Self-Supervised Speech and Text RepresentationsCode1
Speech Enhancement with Multi-granularity Vector Quantization0
Analysing Diffusion-based Generative Approaches versus Discriminative Approaches for Speech RestorationCode0
Speech Dereverberation with a Reverberation Time Shortening TargetCode0
BirdSoundsDenoising: Deep Visual Audio Denoising for Bird SoundsCode1
CMGAN: Conformer-Based Metric-GAN for Monaural Speech EnhancementCode2
FRA-RIR: Fast Random Approximation of the Image-source MethodCode2
Speech Dereverberation with A Reverberation Time Shortening Target0
VoiceFixer: A Unified Framework for High-Fidelity Speech RestorationCode3
FB-MSTCN: A Full-Band Single-Channel Speech Enhancement Method Based on Multi-Scale Temporal Convolutional Network0
The PCG-AIID System for L3DAS22 Challenge: MIMO and MISO convolutional recurrent Network for Multi Channel Speech Enhancement and Speech Recognition0
Multi-Channel Speech Denoising for Machine Ears0
Speech Denoising in the Waveform Domain with Self-AttentionCode2
Task-specific Optimization of Virtual Channel Linear Prediction-based Speech Dereverberation Front-End for Far-Field Speaker VerificationCode0
S-DCCRN: Super Wide Band DCCRN with learnable complex feature for speech enhancement0
Self-Supervised Speech Denoising Using Only Noisy Audio SignalsCode1
WavLM: Large-Scale Self-Supervised Pre-Training for Full Stack Speech ProcessingCode3
Spark Deficient Gabor Frames for Inverse Problems0
Perceptual-based deep-learning denoiser as a defense against adversarial attacks on ASR systems0
Incorporating Multi-Target in Multi-Stage Speech Enhancement Model for Better Generalization0
Zero-Shot Personalized Speech Enhancement through Speaker-Informed Model Selection0
Speech Denoising Without Clean Training Data: A Noise2Noise ApproachCode1
Learning robust speech representation with an articulatory-regularized variational autoencoder0
TSTNN: Two-stage Transformer based Neural Network for Speech Enhancement in the Time Domain0
A Modulation-Domain Loss for Neural-Network-based Real-time Speech EnhancementCode1
Monaural Speech Enhancement with Complex Convolutional Block Attention Module and Joint Time Frequency LossesCode2
Visual Speech Enhancement Without A Real Visual StreamCode1
Speech Denoising with Auditory ModelsCode1
Improving Speech Enhancement Performance by Leveraging Contextual Broad Phonetic Class Information0
Perceptual Loss based Speech Denoising with an ensemble of Audio Pattern Recognition and Self-Supervised ModelsCode0
Listening to Sounds of Silence for Speech DenoisingCode1
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1CBAK2.41Unverified
#ModelMetricClaimedVerifiedStatus
1CBAK2.47Unverified