BASPRO: a balanced script producer for speech corpus collection based on the genetic algorithm Dec 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1SpeechLMScore: Evaluating speech generation using speech language model Dec 8, 2022 Language Modeling Language Modelling
Code Code Available 1High Fidelity Speech Enhancement with Band-split RNN Dec 1, 2022 Speech Enhancement Vocal Bursts Intensity Prediction
Code Code Available 1McNet: Fuse Multiple Cues for Multichannel Speech Enhancement Nov 16, 2022 Speech Enhancement
Code Code Available 1SceneFake: An Initial Dataset and Benchmarks for Scene Fake Audio Detection Nov 11, 2022 Speech Enhancement
Code Code Available 1Inference and Denoise: Causal Inference-based Neural Speech Enhancement Nov 2, 2022 Causal Inference Speech Enhancement
Code Code Available 1Diffusion-based Generative Speech Source Separation Oct 31, 2022 Speech Enhancement
Code Code Available 1Diffiner: A Versatile Diffusion-based Generative Refiner for Speech Enhancement Oct 27, 2022 Denoising Speech Enhancement
Code Code Available 1MMS-MSG: A Multi-purpose Multi-Speaker Mixture Signal Generator Sep 23, 2022 Speech Enhancement
Code Code Available 1Improving Speech Enhancement through Fine-Grained Speech Characteristics Jul 1, 2022 Deep Learning Speech Enhancement
Code Code Available 1A light-weight full-band speech enhancement model Jun 29, 2022 Speech Enhancement
Code Code Available 1Insights Into Deep Non-linear Filters for Improved Multi-channel Speech Enhancement Jun 27, 2022 Speech Enhancement
Code Code Available 1A Systematic Comparison of Phonetic Aware Techniques for Speech Enhancement Jun 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1On the Role of Spatial, Spectral, and Temporal Processing for DNN-based Non-linear Multi-channel Speech Enhancement Jun 22, 2022 Speech Enhancement Speech Extraction
Code Code Available 1Universal Speech Enhancement with Score-based Diffusion Jun 7, 2022 Speech Enhancement
Code Code Available 1U-Former: Improving Monaural Speech Enhancement with Multi-head Self and Cross Attention May 18, 2022 Decoder Speech Enhancement
Code Code Available 1Boosting Self-Supervised Embeddings for Speech Enhancement Apr 7, 2022 Self-Supervised Learning Speech Enhancement
Code Code Available 1Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis Mar 31, 2022 Speech Enhancement
Code Code Available 1Perceptual Contrast Stretching on Target Feature for Speech Enhancement Mar 31, 2022 Speech Enhancement
Code Code Available 1Speech Enhancement with Score-Based Generative Models in the Complex STFT Domain Mar 31, 2022 Speech Enhancement
Code Code Available 1Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition Mar 28, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1HiFi++: a Unified Framework for Bandwidth Extension and Speech Enhancement Mar 24, 2022 Audio Generation Bandwidth Extension
Code Code Available 1MANNER: Multi-view Attention Network for Noise Erasure Mar 4, 2022 Decoder Speech Enhancement
Code Code Available 1Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement Mar 4, 2022 Active Speaker Detection Multi-Task Learning
Code Code Available 1L3DAS22 Challenge: Learning 3D Audio Sources in a Real Office Environment Feb 21, 2022 Sound Event Localization and Detection Speech Enhancement
Code Code Available 1RemixIT: Continual self-training of speech enhancement models via bootstrapped remixing Feb 17, 2022 Domain Adaptation Speech Enhancement
Code Code Available 1HGCN: Harmonic gated compensation network for speech enhancement Jan 30, 2022 Action Detection Activity Detection
Code Code Available 1Towards Intelligibility-Oriented Audio-Visual Speech Enhancement Nov 18, 2021 Speech Enhancement
Code Code Available 1MultiSV: Dataset for Far-Field Multi-Channel Speaker Verification Nov 11, 2021 Denoising Speaker Verification
Code Code Available 1Unsupervised Noise Adaptive Speech Enhancement by Discriminator-Constrained Optimal Transport Nov 11, 2021 Domain Adaptation Speech Enhancement
Code Code Available 1Uformer: A Unet based dilated complex & real dual-path conformer network for simultaneous speech enhancement and dereverberation Nov 11, 2021 Decoder Speech Enhancement
Code Code Available 1Deep Learning-based Non-Intrusive Multi-Objective Speech Assessment Model with Cross-Domain Features Nov 3, 2021 Prediction Speech Enhancement
Code Code Available 1Continual self-training with bootstrapped remixing for speech enhancement Oct 19, 2021 Domain Adaptation Speech Enhancement
Code Code Available 1SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing Oct 14, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Toward Degradation-Robust Voice Conversion Oct 14, 2021 Denoising Speech Enhancement
Code Code Available 1Dual-branch Attention-In-Attention Transformer for single-channel speech enhancement Oct 13, 2021 Speech Enhancement
Code Code Available 1MetricGAN-U: Unsupervised speech enhancement/ dereverberation based only on noisy/ reverberated speech Oct 12, 2021 Speech Enhancement
Code Code Available 1Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition Oct 11, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1NORESQA: A Framework for Speech Quality Assessment using Non-Matching References Sep 16, 2021 Speech Enhancement
Code Code Available 1A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement Aug 26, 2021 Speech Enhancement
Code Code Available 1Complex-valued Spatial Autoencoders for Multichannel Speech Enhancement Aug 6, 2021 Speech Enhancement
Code Code Available 1A Causal U-net based Neural Beamforming Network for Real-Time Multi-Channel Speech Enhancement Aug 1, 2021 CPU Speech Enhancement
Code Code Available 1Microphone Array Generalization for Multichannel Narrowband Deep Speech Enhancement Jul 27, 2021 Speech Enhancement
Code Code Available 1A Study on Speech Enhancement Based on Diffusion Probabilistic Model Jul 25, 2021 Speech Enhancement
Code Code Available 1Multi-Task Audio Source Separation Jul 14, 2021 Audio Source Separation Multi-task Audio Source Seperation
Code Code Available 1EasyCom: An Augmented Reality Dataset to Support Algorithms for Easy Communication in Noisy Environments Jul 9, 2021 Speech Enhancement
Code Code Available 1TENET: A Time-reversal Enhancement Network for Noise-robust ASR Jul 4, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Unsupervised Speech Enhancement using Dynamical Variational Auto-Encoders Jun 23, 2021 Representation Learning Speech Enhancement
Code Code Available 1MeshRIR: A Dataset of Room Impulse Responses on Meshed Grid Points For Evaluating Sound Field Analysis and Synthesis Methods Jun 21, 2021 Distant Speech Recognition Room Impulse Response (RIR)
Code Code Available 1Attention-based distributed speech enhancement for unconstrained microphone arrays with varying number of nodes Jun 15, 2021 Speech Enhancement
Code Code Available 1