| LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition | Jun 6, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Joint Beam Search Integrating CTC, Attention, and Transducer Decoders | Jun 5, 2024 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Error-preserving Automatic Speech Recognition of Young English Learners' Language | Jun 5, 2024 | Automatic Speech RecognitionLanguage Modelling | CodeCode Available | 0 |
| Enhancing CTC-based speech recognition with diverse modeling units | Jun 5, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Text Injection for Neural Contextual Biasing | Jun 5, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition | Jun 5, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Keyword-Guided Adaptation of Automatic Speech Recognition | Jun 4, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Efficiently Train ASR Models that Memorize Less and Perform Better with Per-core Clipping | Jun 4, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Whistle: Data-Efficient Multilingual and Crosslingual Speech Recognition via Weakly Phonetic Supervision | Jun 4, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Enabling ASR for Low-Resource Languages: A Comprehensive Dataset Creation Approach | Jun 3, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Wav2Prompt: End-to-End Speech Prompt Generation and Tuning For LLM in Zero and Few-shot Learning | Jun 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Zipper: A Multi-Tower Decoder Architecture for Fusing Modalities | May 29, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Intelligent Clinical Documentation: Harnessing Generative AI for Patient-Centric Clinical Note Generation | May 28, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Variance-Preserving Interpolation Approach for Diffusion Models with Applications to Single Channel Speech Enhancement and Recognition | May 27, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Federating Dynamic Models using Early-Exit Architectures for Automatic Speech Recognition on Heterogeneous Clients | May 27, 2024 | Automatic Speech RecognitionFederated Learning | CodeCode Available | 0 |
| Denoising LM: Pushing the Limits of Error Correction Models for Speech Recognition | May 24, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Contrastive and Consistency Learning for Neural Noisy-Channel Model in Spoken Language Understanding | May 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation Models | May 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 3 |
| Let's Fuse Step by Step: A Generative Fusion Decoding Algorithm with LLMs for Multi-modal Text Recognition | May 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| Joint Optimization of Streaming and Non-Streaming Automatic Speech Recognition with Multi-Decoder and Knowledge Distillation | May 22, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Contextualized Automatic Speech Recognition with Dynamic Vocabulary | May 22, 2024 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| You don't understand me!: Comparing ASR results for L1 and L2 speakers of Swedish | May 22, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| FairLENS: Assessing Fairness in Law Enforcement Speech Recognition | May 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Listen Again and Choose the Right Answer: A New Paradigm for Automatic Speech Recognition with Large Language Models | May 16, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Continued Pretraining for Domain Adaptation of Wav2vec2.0 in Automatic Speech Recognition for Elementary Math Classroom Settings | May 15, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Towards Evaluating the Robustness of Automatic Speech Recognition Systems via Audio Style Transfer | May 15, 2024 | Adversarial AttackAutomatic Speech Recognition | —Unverified | 0 |
| Sonos Voice Control Bias Assessment Dataset: A Methodology for Demographic Bias Assessment in Voice Assistants | May 14, 2024 | Automatic Speech RecognitionDiversity | —Unverified | 0 |
| SpeechVerse: A Large-scale Generalizable Audio Language Model | May 14, 2024 | Automatic Speech RecognitionBenchmarking | —Unverified | 0 |
| SoccerNet-Echoes: A Soccer Game Audio Commentary Dataset | May 12, 2024 | Action SpottingAutomatic Speech Recognition | CodeCode Available | 1 |
| Lost in Transcription: Identifying and Quantifying the Accuracy Biases of Automatic Speech Recognition Systems Against Disfluent Speech | May 10, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models | May 9, 2024 | Adversarial AttackAutomatic Speech Recognition | CodeCode Available | 1 |
| Open Implementation and Study of BEST-RQ for Speech Processing | May 7, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| MMGER: Multi-modal and Multi-granularity Generative Error Correction with LLM for Joint Accent and Speech Recognition | May 6, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Combining X-Vectors and Bayesian Batch Active Learning: Two-Stage Active Learning Pipeline for Speech Recognition | May 3, 2024 | Active LearningAutomatic Speech Recognition | —Unverified | 0 |
| Unveiling the Potential of LLM-Based ASR on Chinese Open-Source Datasets | May 3, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Sequence-to-sequence models in peer-to-peer learning: A practical application | May 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Efficient Compression of Multitask Multilingual Speech Models | May 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Membership Inference in ASR Model Auditing with Perturbed Loss Features | May 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Does Whisper understand Swiss German? An automatic, qualitative, and human evaluation | Apr 30, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Automatic Speech Recognition System-Independent Word Error Rate Estimation | Apr 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| U2++ MoE: Scaling 4.7x parameters with minimal impact on RTF | Apr 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Developing Acoustic Models for Automatic Speech Recognition in Swedish | Apr 25, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Gated Low-rank Adaptation for personalized Code-Switching Automatic Speech Recognition on the low-spec devices | Apr 24, 2024 | Automatic Speech RecognitionCPU | —Unverified | 0 |
| Breaking Walls: Pioneering Automatic Speech Recognition for Central Kurdish: End-to-End Transformer Paradigm | Apr 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Rethinking Processing Distortions: Disentangling the Impact of Speech Enhancement Errors on Speech Recognition Performance | Apr 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic Information | Apr 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Less Peaky and More Accurate CTC Forced Alignment by Label Priors | Apr 22, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Semantically Corrected Amharic Automatic Speech Recognition | Apr 20, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Efficient infusion of self-supervised representations in Automatic Speech Recognition | Apr 19, 2024 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Artificial Neural Networks to Recognize Speakers Division from Continuous Bengali Speech | Apr 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |