| A GEN AI Framework for Medical Note Generation | Sep 27, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Speech-Mamba: Long-Context Speech Recognition with Selective State Spaces Models | Sep 27, 2024 | Automatic Speech RecognitionMamba | —Unverified | 0 |
| Improving Multilingual ASR in the Wild Using Simple N-best Re-ranking | Sep 27, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Unveiling the Role of Pretraining in Direct Speech Translation | Sep 26, 2024 | Automatic Speech RecognitionDecoder | —Unverified | 0 |
| Deep CLAS: Deep Contextual Listen, Attend and Spell | Sep 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Are Transformers in Pre-trained LM A Good ASR Encoder? An Empirical Study | Sep 26, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events | Sep 25, 2024 | Audio TaggingAutomatic Speech Recognition | —Unverified | 0 |
| Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech Recognition | Sep 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Speech Recognition Rescoring with Large Speech-Text Foundation Models | Sep 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling | Sep 25, 2024 | Automatic Speech RecognitionEmotion Recognition | CodeCode Available | 0 |