| Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition | Mar 28, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| LV-CTC: Non-autoregressive ASR with CTC and latent variable models | Mar 28, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ZAEBUC-Spoken: A Multilingual Multidialectal Arabic-English Speech Corpus | Mar 27, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition | Mar 26, 2024 | Automatic Speech RecognitionLanguage Modelling | —Unverified | 0 |
| Extracting Biomedical Entities from Noisy Audio Transcripts | Mar 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models | Mar 25, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| A Multimodal Approach to Device-Directed Speech Detection with Large Language Models | Mar 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| BanglaNum -- A Public Dataset for Bengali Digit Recognition from Speech | Mar 20, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning | Mar 20, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition | Mar 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |