Schrödinger Bridge for Generative Speech Enhancement Jul 22, 2024 Denoising Speech Denoising
— Unverified 0Wideband Relative Transfer Function (RTF) Estimation Exploiting Frequency Correlations Jul 19, 2024 Fault Detection Speech Enhancement
Code Code Available 0Vibravox: A Dataset of French Speech Captured with Body-conduction Audio Sensors Jul 16, 2024 Automatic Phoneme Recognition Automatic Speech Recognition (ASR)
Code Code Available 1RT-LA-VocE: Real-Time Low-SNR Audio-Visual Speech Enhancement Jul 10, 2024 Speech Enhancement
— Unverified 0Unsupervised Face-Masked Speech Enhancement Using Generative Adversarial Networks With Human-in-the-Loop Assessment Metrics Jul 2, 2024 Speech Enhancement
— Unverified 0Open-Source Conversational AI with SpeechBrain 1.0 Jun 29, 2024 Language Modeling Language Modelling
— Unverified 0Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection from Speech in Real-World Operative Conditions Jun 23, 2024 Audio Classification Parkinson Detection from Speech
Code Code Available 1DASB -- Discrete Audio and Speech Benchmark Jun 20, 2024 Benchmarking Emotion Recognition
— Unverified 0Diffusion-based Generative Modeling with Discriminative Guidance for Streamable Speech Enhancement Jun 19, 2024 Speech Enhancement
— Unverified 0Universal Score-based Speech Enhancement with High Content Preservation Jun 18, 2024 Speech Enhancement
Code Code Available 2Spatially constrained vs. unconstrained filtering in neural spatiospectral filters for multichannel speech enhancement Jun 17, 2024 Speech Enhancement
— Unverified 0An Exploration of Length Generalization in Transformer-Based Speech Enhancement Jun 17, 2024 Position Speech Enhancement
— Unverified 0AV-CrossNet: an Audiovisual Complex Spectral Mapping Network for Speech Separation By Leveraging Narrow- and Cross-Band Modeling Jun 17, 2024 Speaker Separation Speech Enhancement
Code Code Available 1Personalized Speech Enhancement Without a Separate Speaker Embedding Model Jun 14, 2024 Speech Enhancement
— Unverified 0FlowAVSE: Efficient Audio-Visual Speech Enhancement with Conditional Flow Matching Jun 13, 2024 Speech Enhancement
— Unverified 0Comparative Analysis of Personalized Voice Activity Detection Systems: Assessing Real-World Effectiveness Jun 12, 2024 Action Detection Activity Detection
— Unverified 0Pre-training Feature Guided Diffusion Model for Speech Enhancement Jun 11, 2024 Speech Enhancement
— Unverified 0The Effect of Training Dataset Size on Discriminative and Diffusion-Based Speech Enhancement Systems Jun 10, 2024 Diversity Image Generation
— Unverified 0EARS: An Anechoic Fullband Speech Dataset Benchmarked for Speech Enhancement and Dereverberation Jun 10, 2024 Speech Enhancement
Code Code Available 3Thunder : Unified Regression-Diffusion Speech Enhancement with a Single Reverse Step using Brownian Bridge Jun 10, 2024 regression Speech Enhancement
— Unverified 0An Investigation of Noise Robustness for Flow-Matching-Based Zero-Shot TTS Jun 9, 2024 Denoising Speech Denoising
— Unverified 0URGENT Challenge: Universality, Robustness, and Generalizability For Speech Enhancement Jun 7, 2024 Bandwidth Extension Denoising
— Unverified 0Flexible Multichannel Speech Enhancement for Noise-Robust Frontend Jun 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Helsinki Speech Challenge 2024 Jun 6, 2024 Speech Enhancement speech-recognition
— Unverified 0Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enhancement Jun 6, 2024 Diversity Speech Enhancement
Code Code Available 1PLDNet: PLD-Guided Lightweight Deep Network Boosted by Efficient Attention for Handheld Dual-Microphone Speech Enhancement Jun 6, 2024 Speech Enhancement
— Unverified 0Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment Jun 5, 2024 Attribute Speech Enhancement
Code Code Available 1Reference Channel Selection by Multi-Channel Masking for End-to-End Multi-Channel Speech Enhancement Jun 5, 2024 channel selection Speech Enhancement
— Unverified 0The PESQetarian: On the Relevance of Goodhart's Law for Speech Enhancement Jun 5, 2024 Speech Enhancement
— Unverified 0Speech enhancement deep-learning architecture for efficient edge processing May 27, 2024 Deep Learning Generative Adversarial Network
— Unverified 0A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition May 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Non-autoregressive real-time Accent Conversion model with voice cloning May 21, 2024 Speech Enhancement speech-recognition
— Unverified 0Mamba in Speech: Towards an Alternative to Self-Attention May 21, 2024 Mamba Speech Enhancement
Code Code Available 2Monaural speech enhancement on drone via Adapter based transfer learning May 16, 2024 Speech Enhancement Transfer Learning
— Unverified 0Building a Luganda Text-to-Speech Model From Crowdsourced Data May 16, 2024 Speech Enhancement text-to-speech
— Unverified 0Evaluating Speech Enhancement Systems Through Listening Effort May 13, 2024 Speech Enhancement
— Unverified 0An Investigation of Incorporating Mamba for Speech Enhancement May 10, 2024 Mamba Speech Enhancement
Code Code Available 3Real-time multichannel deep speech enhancement in hearing aids: Comparing monaural and binaural processing in complex acoustic scenarios May 3, 2024 Deep Learning Speech Enhancement
— Unverified 0TRAMBA: A Hybrid Transformer and Mamba Architecture for Practical Audio and Bone Conduction Speech Super Resolution and Enhancement on Mobile and Wearable Platforms May 2, 2024 Mamba Speech Enhancement
— Unverified 0Deep low-latency joint speech transmission and enhancement over a gaussian channel Apr 30, 2024 Decoder Speech Enhancement
— Unverified 0Rethinking Processing Distortions: Disentangling the Impact of Speech Enhancement Errors on Speech Recognition Performance Apr 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Exploring the Potential of Data-Driven Spatial Audio Enhancement Using a Single-Channel Model Apr 22, 2024 Direction of Arrival Estimation Speech Enhancement
— Unverified 0TRNet: Two-level Refinement Network leveraging Speech Enhancement for Noise Robust Speech Emotion Recognition Apr 19, 2024 Emotion Recognition Speech Emotion Recognition
— Unverified 0FSPEN: AN ULTRA-LIGHTWEIGHT NETWORK FOR REAL TIME SPEECH ENAHNCMENT Apr 15, 2024 Speech Enhancement
Code Code Available 2Efficient High-Performance Bark-Scale Neural Network for Residual Echo and Noise Suppression Apr 8, 2024 Speech Enhancement
— Unverified 0Artificial Intelligence for Cochlear Implants: Review of Strategies, Challenges, and Perspectives Mar 17, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SuperM2M: Supervised and Mixture-to-Mixture Co-Learning for Speech Enhancement and Noise-Robust ASR Mar 15, 2024 Speaker Separation Speech Enhancement
— Unverified 0How to train your ears: Auditory-model emulation for large-dynamic-range inputs and mild-to-severe hearing losses Mar 15, 2024 Speech Enhancement
Code Code Available 0Binaural Speech Enhancement Using Deep Complex Convolutional Transformer Networks Mar 8, 2024 Decoder Speech Enhancement
Code Code Available 1A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement Mar 3, 2024 Automatic Speech Recognition Keyword Spotting
— Unverified 0