| HENT-SRT: Hierarchical Efficient Neural Transducer with Self-Distillation for Joint Speech Recognition and Translation | Jun 2, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Analyzing the Importance of Blank for CTC-Based Knowledge Distillation | Jun 2, 2025 | Automatic Speech RecognitionKnowledge Distillation | —Unverified | 0 |
| Unsupervised Rhythm and Voice Conversion to Improve ASR on Dysarthric Speech | Jun 2, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Self-Supervised Speech Quality Assessment (S3QA): Leveraging Speech Foundation Models for a Scalable Speech Quality Metric | Jun 2, 2025 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| What do self-supervised speech models know about Dutch? Analyzing advantages of language-specific pre-training | Jun 1, 2025 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| GigaAM: Efficient Self-Supervised Learner for Speech Recognition | Jun 1, 2025 | Automatic Speech RecognitionLanguage Modeling | CodeCode Available | 4 |
| DYNAC: Dynamic Vocabulary based Non-Autoregressive Contextualization for Speech Recognition | May 31, 2025 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Causal Structure Discovery for Error Diagnostics of Children's ASR | May 31, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Towards Temporally Explainable Dysarthric Speech Clarity Assessment | May 31, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Vedavani: A Benchmark Corpus for ASR on Vedic Sanskrit Poetry | May 30, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| MSDA: Combining Pseudo-labeling and Self-Supervision for Unsupervised Domain Adaptation in ASR | May 30, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Dynamic Context-Aware Streaming Pretrained Language Model For Inverse Text Normalization | May 30, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Multilingual Speech Models on ML-SUPERB 2.0: Fine-tuning with Data Augmentation and LID-Aware CTC | May 30, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Running Conventional Automatic Speech Recognition on Memristor Hardware: A Simulated Approach | May 30, 2025 | Automatic Speech RecognitionQuantization | —Unverified | 0 |
| Fewer Hallucinations, More Verification: A Three-Stage LLM-Based Framework for ASR Error Correction | May 30, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| BeaverTalk: Oregon State University's IWSLT 2025 Simultaneous Speech Translation System | May 29, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Prompting Whisper for Improved Verbatim Transcription and End-to-end Miscue Detection | May 29, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Contextualized Automatic Speech Recognition with Dynamic Vocabulary Prediction and Activation | May 29, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| NGPU-LM: GPU-Accelerated N-Gram Language Model for Context-Biasing in Greedy ASR Decoding | May 28, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Advancing Hearing Assessment: An ASR-Based Frequency-Specific Speech Test for Diagnosing Presbycusis | May 28, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| PSRB: A Comprehensive Benchmark for Evaluating Persian ASR Systems | May 27, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Loquacious Set: 25,000 Hours of Transcribed and Diverse English Speech Recognition Data for Research and Commercial Use | May 27, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Towards Pretraining Robust ASR Foundation Model with Acoustic-Aware Data Augmentation | May 27, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| In-context Language Learning for Endangered Languages in Speech Recognition | May 26, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| The NaijaVoices Dataset: Cultivating Large-Scale, High-Quality, Culturally-Rich Speech Data for African Languages | May 26, 2025 | Automatic Speech RecognitionDiversity | —Unverified | 0 |
| Robust fine-tuning of speech recognition models via model merging: application to disordered speech | May 26, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Beyond Manual Transcripts: The Potential of Automated Speech Recognition Errors in Improving Alzheimer's Disease Detection | May 26, 2025 | Alzheimer's Disease DetectionAutomatic Speech Recognition | —Unverified | 0 |
| Continuous Learning for Children's ASR: Overcoming Catastrophic Forgetting with Elastic Weight Consolidation and Synaptic Intelligence | May 26, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Mixture of LoRA Experts for Low-Resourced Multi-Accent Automatic Speech Recognition | May 26, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| KIT's Low-resource Speech Translation Systems for IWSLT2025: System Enhancement with Synthetic Data and Model Regularization | May 26, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Exploring Generative Error Correction for Dysarthric Speech Recognition | May 26, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| CHSER: A Dataset and Case Study on Generative Speech Error Correction for Child ASR | May 24, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| VietASR: Achieving Industry-level Vietnamese ASR with 50-hour labeled data and Large-Scale Speech Pretraining | May 23, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| LLM-based Generative Error Correction for Rare Words with Synthetic Data and Phonetic Context | May 23, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| CosyVoice 3: Towards In-the-wild Speech Generation via Scaling-up and Post-training | May 23, 2025 | Automatic Speech RecognitionEmotion Recognition | CodeCode Available | 11 |
| Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities | May 23, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game Understanding | May 22, 2025 | Action ClassificationAutomatic Speech Recognition | CodeCode Available | 0 |
| An Effective Training Framework for Light-Weight Automatic Speech Recognition Models | May 22, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Large Language Models based ASR Error Correction for Child Conversations | May 22, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| From Tens of Hours to Tens of Thousands: Scaling Back-Translation for Speech Recognition | May 22, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Word Level Timestamp Generation for Automatic Speech Recognition and Translation | May 21, 2025 | Automatic Speech Recognitionautomatic-speech-translation | CodeCode Available | 0 |
| From Weak Labels to Strong Results: Utilizing 5,000 Hours of Noisy Classroom Transcripts with Minimal Accurate Data | May 20, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Impact of Frame Rates on Speech Tokenizer: A Case Study on Mandarin and English | May 20, 2025 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| In-Context Learning Boosts Speech Recognition via Human-like Adaptation to Speakers and Language Varieties | May 20, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Towards Inclusive ASR: Investigating Voice Conversion for Dysarthric Speech Recognition in Low-Resource Languages | May 20, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| PersonaTAB: Predicting Personality Traits using Textual, Acoustic, and Behavioral Cues in Fully-Duplex Speech Dialogs | May 20, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Calm-Whisper: Reduce Whisper Hallucination On Non-Speech By Calming Crazy Heads Down | May 19, 2025 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| KIT's Offline Speech Translation and Instruction Following Submission for IWSLT 2025 | May 19, 2025 | Automatic Speech RecognitionInstruction Following | —Unverified | 0 |
| Cross-modal Knowledge Transfer Learning as Graph Matching Based on Optimal Transport for ASR | May 19, 2025 | Automatic Speech RecognitionGraph Matching | —Unverified | 0 |
| ASR-FAIRBENCH: Measuring and Benchmarking Equity Across Speech Recognition Systems | May 16, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |