| Zero Shot Text to Speech Augmentation for Automatic Speech Recognition on Low-Resource Accented Speech Corpora | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses | Sep 17, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| Chain-of-Thought Prompting for Speech Translation | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Study on Zero-shot Non-intrusive Speech Assessment using Large Language Models | Sep 16, 2024 | Automatic Speech RecognitionPrompt Engineering | —Unverified | 0 |
| SMILE: Speech Meta In-Context Learning for Low-Resource Language Automatic Speech Recognition | Sep 16, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| An Efficient Self-Learning Framework For Interactive Spoken Dialog Systems | Sep 16, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Augmenting Automatic Speech Recognition Models with Disfluency Detection | Sep 16, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition | Sep 15, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ASR Error Correction using Large Language Models | Sep 14, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| CPT-Boosted Wav2vec2.0: Towards Noise Robust Speech Recognition for Classroom Environments | Sep 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |