| Speech Recognition Rescoring with Large Speech-Text Foundation Models | Sep 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| How to Connect Speech Foundation Models and Large Language Models? What Matters and What Does Not | Sep 25, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech Recognition | Sep 25, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| MT2KD: Towards A General-Purpose Encoder for Speech, Speaker, and Audio Events | Sep 25, 2024 | Audio TaggingAutomatic Speech Recognition | —Unverified | 0 |
| Semi-Supervised Cognitive State Classification from Speech with Multi-View Pseudo-Labeling | Sep 25, 2024 | Automatic Speech RecognitionEmotion Recognition | CodeCode Available | 0 |
| Revisiting Acoustic Features for Robust ASR | Sep 24, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Bridging Speech and Text: Enhancing ASR with Pinyin-to-Character Pre-training in LLMs | Sep 24, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Boosting Code-Switching ASR with Mixture of Experts Enhanced Speech-Conditioned LLM | Sep 24, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Spelling Correction through Rewriting of Non-Autoregressive ASR Lattices | Sep 24, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Revise, Reason, and Recognize: LLM-Based Emotion Recognition via Emotion-Specific Prompts and ASR Error Correction | Sep 23, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |