4-bit Conformer with Native Quantization Aware Training for Speech Recognition Mar 29, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2CMGAN: Conformer-based Metric GAN for Speech Enhancement Mar 28, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction Jan 5, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Robust Self-Supervised Audio-Visual Speech Recognition Jan 5, 2022 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 2Fast Transformers with Clustered Attention Jul 9, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities May 23, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1From Tens of Hours to Tens of Thousands: Scaling Back-Translation for Speech Recognition May 22, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1DuplexMamba: Enhancing Real-time Speech Conversations with Duplex and Streaming Capabilities Feb 16, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1VINP: Variational Bayesian Inference with Neural Speech Prior for Joint ASR-Effective Speech Dereverberation and Blind RIR Identification Feb 11, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models Feb 9, 2025 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 1Sagalee: an Open Source Automatic Speech Recognition Dataset for Oromo Language Feb 1, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1FlanEC: Exploring Flan-T5 for Post-ASR Error Correction Jan 22, 2025 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1MathSpeech: Leveraging Small LMs for Accurate Conversion in Mathematical Speech-to-Formula Dec 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attention Oct 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1VHASR: A Multimodal Speech Recognition System With Vision Hotwords Oct 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Mamba for Streaming ASR Combined with Unimodal Aggregation Sep 30, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1LI-TTA: Language Informed Test-Time Adaptation for Automatic Speech Recognition Aug 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1ALIF: Low-Cost Adversarial Audio Attacks on Black-Box Speech Platforms using Linguistic Features Aug 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Evolutionary Prompt Design for LLM-Based Post-ASR Error Correction Jul 23, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Framework for Curating Speech Datasets and Evaluating ASR Systems: A Case Study for Polish Jul 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Vibravox: A Dataset of French Speech Captured with Body-conduction Audio Sensors Jul 16, 2024 Automatic Phoneme Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Controlling Whisper: Universal Acoustic Adversarial Attacks to Control Speech Foundation Models Jul 5, 2024 Adversarial Attack Automatic Speech Recognition
Code Code Available 1Improving Self-supervised Pre-training using Accent-Specific Codebooks Jul 4, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Pinyin Regularization in Error Correction for Chinese Speech Recognition with Large Language Models Jul 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1ArzEn-LLM: Code-Switched Egyptian Arabic-English Translation and Speech Recognition Using LLMs Jun 26, 2024 ArzEn Code-switched Translation to ara ArzEn Code-switched Translation to eng
Code Code Available 1Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet Jun 25, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy Speech Jun 16, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition Jun 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition May 27, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1SoccerNet-Echoes: A Soccer Game Audio Commentary Dataset May 12, 2024 Action Spotting Automatic Speech Recognition
Code Code Available 1Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models May 9, 2024 Adversarial Attack Automatic Speech Recognition
Code Code Available 1Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets May 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Less Peaky and More Accurate CTC Forced Alignment by Label Priors Apr 22, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Speech Robust Bench: A Robustness Benchmark For Speech Recognition Mar 8, 2024 Adversarial Robustness Automatic Speech Recognition
Code Code Available 1A Cross-Modal Approach to Silent Speech with LLM-Enhanced Recognition Mar 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition Feb 8, 2024 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 1REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR Feb 6, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Word-Level ASR Quality Estimation for Efficient Corpus Sampling and Post-Editing through Analyzing Attentions of a Reference-Free Metric Jan 20, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Extending Whisper with prompt tuning to target-speaker ASR Dec 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1D4AM: A General Denoising Framework for Downstream Acoustic Models Nov 28, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation Nov 9, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Improved Child Text-to-Speech Synthesis through Fastpitch-based Transfer Learning Nov 7, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Multilingual DistilWhisper: Efficient Distillation of Multi-task Speech Models via Language-Specific Experts Nov 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Automatic Disfluency Detection from Untranscribed Speech Nov 1, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Developing a Multilingual Dataset and Evaluation Metrics for Code-Switching: A Focus on Hong Kong's Polylingual Dynamics Oct 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1CL-MASR: A Continual Learning Benchmark for Multilingual ASR Oct 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1ArTST: Arabic Text and Speech Transformer Oct 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Accented Speech Recognition With Accent-specific Codebooks Oct 24, 2023 Accented Speech Recognition Automatic Speech Recognition
Code Code Available 1Advancing Test-Time Adaptation in Wild Acoustic Test Settings Oct 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Speech collage: code-switched audio generation by collaging monolingual corpora Sep 27, 2023 Audio Generation Automatic Speech Recognition
Code Code Available 1