| Large Language Models for Dysfluency Detection in Stuttered Speech | Jun 16, 2024 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| Imperceptible Rhythm Backdoor Attacks: Exploring Rhythm Transformation for Embedding Undetectable Vulnerabilities on Speech Recognition | Jun 16, 2024 | Automatic Speech RecognitionData Poisoning | —Unverified | 0 |
| Optimizing Byte-level Representation for End-to-end ASR | Jun 14, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Inclusive ASR for Disfluent Speech: Cascaded Large-Scale Self-Supervised Learning with Targeted Fine-Tuning and Data Augmentation | Jun 14, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Learning Language Structures through Grounding | Jun 14, 2024 | Automatic Speech RecognitionDependency Parsing | —Unverified | 0 |
| ROAR: Reinforcing Original to Augmented Data Ratio Dynamics for Wav2Vec2.0 Based ASR | Jun 14, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| An efficient text augmentation approach for contextualized Mandarin speech recognition | Jun 14, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multi-Modal Retrieval For Large Language Model Based Speech Recognition | Jun 13, 2024 | Automatic Speech RecognitionLanguage Modeling | —Unverified | 0 |
| Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn't | Jun 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| The Second DISPLACE Challenge : DIarization of SPeaker and LAnguage in Conversational Environments | Jun 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |