| Whisper Turns Stronger: Augmenting Wav2Vec 2.0 for Superior ASR in Low-Resource Languages | Dec 31, 2024 | Automatic Speech RecognitionData Augmentation | —Unverified | 0 |
| Fotheidil: an Automatic Transcription System for the Irish Language | Dec 31, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition | Dec 30, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| Enhancing Whisper's Accuracy and Speed for Indian Languages through Prompt-Tuning and Tokenization | Dec 27, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Enhancing Audiovisual Speech Recognition through Bifocal Preference Optimization | Dec 26, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Zero-resource Speech Translation and Recognition with LLMs | Dec 24, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| UME: Upcycling Mixture-of-Experts for Scalable and Efficient Automatic Speech Recognition | Dec 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Enhancing Multilingual ASR for Unseen Languages via Language Embedding Modeling | Dec 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Transducer-Llama: Integrating LLMs into Streamable Transducer-based Speech Recognition | Dec 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Adapting Whisper for Code-Switching through Encoding Refining and Language-Aware Decoding | Dec 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Speech Retrieval-Augmented Generation without Automatic Speech Recognition | Dec 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| TouchASP: Elastic Automatic Speech Perception that Everyone Can Touch | Dec 20, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| MathSpeech: Leveraging Small LMs for Accurate Conversion in Mathematical Speech-to-Formula | Dec 20, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| LAMA-UT: Language Agnostic Multilingual ASR through Orthography Unification and Language-Specific Transliteration | Dec 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Transcribing and Translating, Fast and Slow: Joint Speech Translation and Recognition | Dec 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Streaming Keyword Spotting Boosted by Cross-layer Discrimination Consistency | Dec 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| Speak & Improve Challenge 2025: Tasks and Baseline Systems | Dec 16, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Speak & Improve Corpus 2025: an L2 English Speech Corpus for Language Assessment and Feedback | Dec 16, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Transliterated Zero-Shot Domain Adaptation for Automatic Speech Recognition | Dec 15, 2024 | Automatic Speech RecognitionDomain Adaptation | —Unverified | 0 |
| Efficient Adaptation of Multilingual Models for Japanese ASR | Dec 14, 2024 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations Generation | Dec 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Bilevel Joint Unsupervised and Supervised Training for Automatic Speech Recognition | Dec 11, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Harnessing Transfer Learning from Swahili: Advancing Solutions for Comorian Dialects | Dec 9, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection | Dec 9, 2024 | Alzheimer's Disease DetectionAutomatic Speech Recognition | —Unverified | 0 |
| Effective Text Adaptation for LLM-based ASR through Soft Prompt Fine-Tuning | Dec 9, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |