| MSR-86K: An Evolving, Multilingual Corpus with 86,300 Hours of Transcribed Audio for Speech Recognition Research | Jun 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Dynamic Data Pruning for Automatic Speech Recognition | Jun 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ArzEn-LLM: Code-Switched Egyptian Arabic-English Translation and Speech Recognition Using LLMs | Jun 26, 2024 | ArzEn Code-switched Translation to araArzEn Code-switched Translation to eng | CodeCode Available | 1 |
| FASA: a Flexible and Automatic Speech Aligner for Extracting High-quality Aligned Children Speech Data | Jun 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNet | Jun 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Sequential Editing for Lifelong Training of Speech Recognition Models | Jun 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model | Jun 25, 2024 | Automatic Lyrics TranscriptionAutomatic Speech Recognition | CodeCode Available | 1 |
| Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 2024 | Jun 24, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| Contextualized End-to-end Automatic Speech Recognition with Intermediate Biasing Loss | Jun 23, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Decoder-only Architecture for Streaming End-to-end Speech Recognition | Jun 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |