| LM-assisted keyword biasing with Aho-Corasick algorithm for Transducer-based ASR | Sep 20, 2024 | ARCAutomatic Speech Recognition | —Unverified | 0 |
| A Multimodal Dense Retrieval Approach for Speech-Based Open-Domain Question Answering | Sep 20, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Personalized Speech Recognition for Children with Test-Time Adaptation | Sep 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Enhancing Synthetic Training Data for Speech Commands: From ASR-Based Filtering to Domain Adaptation in SSL Latent Space | Sep 19, 2024 | Automatic Speech RecognitionData Augmentation | —Unverified | 0 |
| Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech Recognition | Sep 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| META-CAT: Speaker-Informed Speech Embeddings via Meta Information Concatenation for Multi-talker ASR | Sep 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Large Language Models are Strong Audio-Visual Speech Recognition Learners | Sep 18, 2024 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 2 |
| ASR Benchmarking: Need for a More Representative Conversational Dataset | Sep 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| WER We Stand: Benchmarking Urdu ASR Models | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Ideal-LLM: Integrating Dual Encoders and Language-Adapted LLM for Multilingual Speech-to-Text | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Chain-of-Thought Prompting for Speech Translation | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Zero Shot Text to Speech Augmentation for Automatic Speech Recognition on Low-Resource Accented Speech Corpora | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| M-BEST-RQ: A Multi-Channel Speech Foundation Model for Smart Glasses | Sep 17, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| A Study on Zero-shot Non-intrusive Speech Assessment using Large Language Models | Sep 16, 2024 | Automatic Speech RecognitionPrompt Engineering | —Unverified | 0 |
| SMILE: Speech Meta In-Context Learning for Low-Resource Language Automatic Speech Recognition | Sep 16, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| An Efficient Self-Learning Framework For Interactive Spoken Dialog Systems | Sep 16, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Augmenting Automatic Speech Recognition Models with Disfluency Detection | Sep 16, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Large Language Model Based Generative Error Correction: A Challenge and Baselines for Speech Recognition, Speaker Tagging, and Emotion Recognition | Sep 15, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| ASR Error Correction using Large Language Models | Sep 14, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| CPT-Boosted Wav2vec2.0: Towards Noise Robust Speech Recognition for Classroom Environments | Sep 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Learnings from curating a trustworthy, well-annotated, and useful dataset of disordered English speech | Sep 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Exploring SSL Discrete Tokens for Multilingual ASR | Sep 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Exploring the Impact of Data Quantity on ASR in Extremely Low-resource Languages | Sep 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| LA-RAG:Enhancing LLM-based ASR Accuracy with Retrieval-Augmented Generation | Sep 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multi-modal Speech Transformer Decoders: When Do Multiple Modalities Improve Accuracy? | Sep 13, 2024 | Automatic Speech RecognitionDecoder | —Unverified | 0 |