| Automatic Text Pronunciation Correlation Generation and Application for Contextual Biasing | Jan 1, 2025 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Breaking Through the Spike: Spike Window Decoding for Accelerated and Precise Automatic Speech Recognition | Jan 1, 2025 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale | Jan 1, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Whisper Turns Stronger: Augmenting Wav2Vec 2.0 for Superior ASR in Low-Resource Languages | Dec 31, 2024 | Automatic Speech RecognitionData Augmentation | —Unverified | 0 |
| Fotheidil: an Automatic Transcription System for the Irish Language | Dec 31, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| Enhancing Whisper's Accuracy and Speed for Indian Languages through Prompt-Tuning and Tokenization | Dec 27, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Enhancing Audiovisual Speech Recognition through Bifocal Preference Optimization | Dec 26, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Zero-resource Speech Translation and Recognition with LLMs | Dec 24, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| UME: Upcycling Mixture-of-Experts for Scalable and Efficient Automatic Speech Recognition | Dec 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Enhancing Multilingual ASR for Unseen Languages via Language Embedding Modeling | Dec 21, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |