| Anatomy of Industrial Scale Multilingual ASR | Apr 15, 2024 | AnatomyAutomatic Speech Recognition | —Unverified | 0 |
| Resilience of Large Language Models for Noisy Instructions | Apr 15, 2024 | Automatic Speech RecognitionOptical Character Recognition | —Unverified | 0 |
| Automatic Speech Recognition Advancements for Indigenous Languages of the Americas | Apr 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Comparing Apples to Oranges: LLM-powered Multimodal Intention Prediction in an Object Categorization Task | Apr 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| An Effective Automated Speaking Assessment Approach to Mitigating Data Scarcity and Imbalanced Distribution | Apr 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Conformer-1: Robust ASR via Large-Scale Semisupervised Bootstrapping | Apr 10, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| The X-LANCE Technical Report for Interspeech 2024 Speech Processing Using Discrete Speech Unit Challenge | Apr 9, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Transducers with Pronunciation-aware Embeddings for Automatic Speech Recognition | Apr 4, 2024 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Mai Ho'omāuna i ka 'Ai: Language Models Improve Automatic Speech Recognition in Hawaiian | Apr 3, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Kallaama: A Transcribed Speech Dataset about Agriculture in the Three Most Widely Spoken Languages in Senegal | Apr 2, 2024 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 1 |
| Transfer Learning from Whisper for Microscopic Intelligibility Prediction | Apr 2, 2024 | Automatic Speech RecognitionDeep Learning | —Unverified | 0 |
| Noise Masking Attacks and Defenses for Pretrained Speech Models | Apr 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Houston we have a Divergence: A Subgroup Performance Analysis of ASR Models | Mar 31, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ELITR-Bench: A Meeting Assistant Benchmark for Long-Context Language Models | Mar 29, 2024 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition | Mar 28, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| LV-CTC: Non-autoregressive ASR with CTC and latent variable models | Mar 28, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| PhoWhisper: Automatic Speech Recognition for Vietnamese | Mar 27, 2024 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 3 |
| ZAEBUC-Spoken: A Multilingual Multidialectal Arabic-English Speech Corpus | Mar 27, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Extracting Biomedical Entities from Noisy Audio Transcripts | Mar 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| DANCER: Entity Description Augmented Named Entity Corrector for Automatic Speech Recognition | Mar 26, 2024 | Automatic Speech RecognitionLanguage Modelling | —Unverified | 0 |
| Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models | Mar 25, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| A Multimodal Approach to Device-Directed Speech Detection with Large Language Models | Mar 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning | Mar 20, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| BanglaNum -- A Public Dataset for Bengali Digit Recognition from Speech | Mar 20, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition | Mar 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Artificial Intelligence for Cochlear Implants: Review of Strategies, Challenges, and Perspectives | Mar 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Automatic Speech Recognition (ASR) for the Diagnosis of pronunciation of Speech Sound Disorders in Korean children | Mar 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| SpeechColab Leaderboard: An Open-Source Platform for Automatic Speech Recognition Evaluation | Mar 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 4 |
| Skipformer: A Skip-and-Recover Strategy for Efficient Speech Recognition | Mar 13, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Gujarati-English Code-Switching Speech Recognition using ensemble prediction of spoken language | Mar 12, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| The evaluation of a code-switched Sepedi-English automatic speech recognition system | Mar 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| SCORE: Self-supervised Correspondence Fine-tuning for Improved Content Representations | Mar 10, 2024 | Automatic Speech RecognitionData Augmentation | CodeCode Available | 0 |
| Aligning Speech to Languages to Enhance Code-switching Speech Recognition | Mar 9, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Speech Robust Bench: A Robustness Benchmark For Speech Recognition | Mar 8, 2024 | Adversarial RobustnessAutomatic Speech Recognition | CodeCode Available | 1 |
| Classist Tools: Social Class Correlates with Performance in NLP | Mar 7, 2024 | Automatic Speech RecognitionLanguage Modelling | —Unverified | 0 |
| A New Benchmark for Evaluating Automatic Speech Recognition in the Arabic Call Domain | Mar 7, 2024 | Arabic Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| JEP-KD: Joint-Embedding Predictive Architecture Based Knowledge Distillation for Visual Speech Recognition | Mar 4, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings | Mar 4, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| Language and Speech Technology for Central Kurdish Varieties | Mar 4, 2024 | Automatic Speech RecognitionDiversity | CodeCode Available | 1 |
| What has LeBenchmark Learnt about French Syntax? | Mar 4, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| A Closer Look at Wav2Vec2 Embeddings for On-Device Single-Channel Speech Enhancement | Mar 3, 2024 | Automatic Speech RecognitionKeyword Spotting | —Unverified | 0 |
| A Cross-Modal Approach to Silent Speech with LLM-Enhanced Recognition | Mar 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Automatic Speech Recognition using Advanced Deep Learning Approaches: A survey | Mar 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Post-decoder Biasing for End-to-End Speech Recognition of Multi-turn Medical Interview | Mar 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Inappropriate Pause Detection In Dysarthric Speech Using Large-Scale Speech Recognition | Feb 29, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Probing the Information Encoded in Neural-based Acoustic Models of Automatic Speech Recognition Systems | Feb 29, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit Gender Performance Gaps | Feb 28, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Exploration of Adapter for Noise Robust Automatic Speech Recognition | Feb 28, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Extreme Encoder Output Frame Rate Reduction: Improving Computational Latencies of Large End-to-End Models | Feb 27, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| An Effective Mixture-Of-Experts Approach For Code-Switching Speech Recognition Leveraging Encoder Disentanglement | Feb 27, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |