| META-CAT: Speaker-Informed Speech Embeddings via Meta Information Concatenation for Multi-talker ASR | Sep 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Chain-of-Thought Prompting for Speech Translation | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Zero Shot Text to Speech Augmentation for Automatic Speech Recognition on Low-Resource Accented Speech Corpora | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses | Sep 17, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| WER We Stand: Benchmarking Urdu ASR Models | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Ideal-LLM: Integrating Dual Encoders and Language-Adapted LLM for Multilingual Speech-to-Text | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| An Efficient Self-Learning Framework For Interactive Spoken Dialog Systems | Sep 16, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Augmenting Automatic Speech Recognition Models with Disfluency Detection | Sep 16, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A Study on Zero-shot Non-intrusive Speech Assessment using Large Language Models | Sep 16, 2024 | Automatic Speech RecognitionPrompt Engineering | —Unverified | 0 |
| SMILE: Speech Meta In-Context Learning for Low-Resource Language Automatic Speech Recognition | Sep 16, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |