| Advancing Singlish Understanding: Bridging the Gap with Datasets and Multimodal Models | Jan 2, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale | Jan 1, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Breaking Through the Spike: Spike Window Decoding for Accelerated and Precise Automatic Speech Recognition | Jan 1, 2025 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation | Jan 1, 2025 | Automatic Speech RecognitionDecoder | CodeCode Available | 1 |
| Automatic Text Pronunciation Correlation Generation and Application for Contextual Biasing | Jan 1, 2025 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Fotheidil: an Automatic Transcription System for the Irish Language | Dec 31, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| Whisper Turns Stronger: Augmenting Wav2Vec 2.0 for Superior ASR in Low-Resource Languages | Dec 31, 2024 | Automatic Speech RecognitionData Augmentation | —Unverified | 0 |
| DiCoW: Diarization-Conditioned Whisper for Target Speaker Automatic Speech Recognition | Dec 30, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| Enhancing Whisper's Accuracy and Speed for Indian Languages through Prompt-Tuning and Tokenization | Dec 27, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Enhancing Audiovisual Speech Recognition through Bifocal Preference Optimization | Dec 26, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |