| Anatomy of Industrial Scale Multilingual ASR | Apr 15, 2024 | AnatomyAutomatic Speech Recognition | —Unverified | 0 |
| Resilience of Large Language Models for Noisy Instructions | Apr 15, 2024 | Automatic Speech RecognitionOptical Character Recognition | —Unverified | 0 |
| Automatic Speech Recognition Advancements for Indigenous Languages of the Americas | Apr 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Comparing Apples to Oranges: LLM-powered Multimodal Intention Prediction in an Object Categorization Task | Apr 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| An Effective Automated Speaking Assessment Approach to Mitigating Data Scarcity and Imbalanced Distribution | Apr 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping | Apr 10, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge | Apr 9, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Transducers with Pronunciation-aware Embeddings for Automatic Speech Recognition | Apr 4, 2024 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Mai Ho'omāuna i ka 'Ai: Language Models Improve Automatic Speech Recognition in Hawaiian | Apr 3, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Transfer Learning from Whisper for Microscopic Intelligibility Prediction | Apr 2, 2024 | Automatic Speech RecognitionDeep Learning | —Unverified | 0 |
| Kallaama: A Transcribed Speech Dataset about Agriculture in the Three Most Widely Spoken Languages in Senegal | Apr 2, 2024 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 1 |
| Noise Masking Attacks and Defenses for Pretrained Speech Models | Apr 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Houston we have a Divergence: A Subgroup Performance Analysis of ASR Models | Mar 31, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ELITR-Bench: A Meeting Assistant Benchmark for Long-Context Language Models | Mar 29, 2024 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition | Mar 28, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| LV-CTC: Non-autoregressive ASR with CTC and latent variable models | Mar 28, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| PhoWhisper: Automatic Speech Recognition for Vietnamese | Mar 27, 2024 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 3 |
| ZAEBUC-Spoken: A Multilingual Multidialectal Arabic-English Speech Corpus | Mar 27, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition | Mar 26, 2024 | Automatic Speech RecognitionLanguage Modelling | —Unverified | 0 |
| Extracting Biomedical Entities from Noisy Audio Transcripts | Mar 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models | Mar 25, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| A Multimodal Approach to Device-Directed Speech Detection with Large Language Models | Mar 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning | Mar 20, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| BanglaNum -- A Public Dataset for Bengali Digit Recognition from Speech | Mar 20, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition | Mar 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |