| Exploring Gender Disparities in Automatic Speech Recognition Technology | Feb 25, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Balancing Speech Understanding and Generation Using Continual Pre-training for Codec-based Speech LLM | Feb 24, 2025 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| Low-Rank and Sparse Model Merging for Multi-Lingual Speech Recognition and Translation | Feb 24, 2025 | Automatic Speech RecognitionDiversity | —Unverified | 0 |
| Understanding Zero-shot Rare Word Recognition Improvements Through LLM Integration | Feb 22, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| The Esethu Framework: Reimagining Sustainable Dataset Governance and Curation for Low-Resource Languages | Feb 21, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Enhancing Speech Large Language Models with Prompt-Aware Mixture of Audio Encoders | Feb 21, 2025 | Audio captioningAutomatic Speech Recognition | —Unverified | 0 |
| WavRAG: Audio-Integrated Retrieval Augmented Generation for Spoken Dialogue Models | Feb 20, 2025 | Automatic Speech RecognitionRAG | —Unverified | 0 |
| Measuring the Effect of Transcription Noise on Downstream Language Understanding Tasks | Feb 19, 2025 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| Adopting Whisper for Confidence Estimation | Feb 19, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Lost in Transcription, Found in Distribution Shift: Demystifying Hallucination in Speech Foundation Models | Feb 18, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |