| Word Level Timestamp Generation for Automatic Speech Recognition and Translation | May 21, 2025 | Automatic Speech Recognitionautomatic-speech-translation | —Unverified | 0 |
| From Weak Labels to Strong Results: Utilizing 5,000 Hours of Noisy Classroom Transcripts with Minimal Accurate Data | May 20, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Impact of Frame Rates on Speech Tokenizer: A Case Study on Mandarin and English | May 20, 2025 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| In-Context Learning Boosts Speech Recognition via Human-like Adaptation to Speakers and Language Varieties | May 20, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Towards Inclusive ASR: Investigating Voice Conversion for Dysarthric Speech Recognition in Low-Resource Languages | May 20, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| PersonaTAB: Predicting Personality Traits using Textual, Acoustic, and Behavioral Cues in Fully-Duplex Speech Dialogs | May 20, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Calm-Whisper: Reduce Whisper Hallucination On Non-Speech By Calming Crazy Heads Down | May 19, 2025 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| KIT's Offline Speech Translation and Instruction Following Submission for IWSLT 2025 | May 19, 2025 | Automatic Speech RecognitionInstruction Following | —Unverified | 0 |
| Cross-modal Knowledge Transfer Learning as Graph Matching Based on Optimal Transport for ASR | May 19, 2025 | Automatic Speech RecognitionGraph Matching | —Unverified | 0 |
| ASR-FAIRBENCH: Measuring and Benchmarking Equity Across Speech Recognition Systems | May 16, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |