| SimClass: A Classroom Speech Dataset Generated via Game Engine Simulation For Automatic Speech Recognition Research | Jun 10, 2025 | Automatic Speech RecognitionData Augmentation | —Unverified | 0 |
| Robust Speech Recognition with Schrödinger Bridge-Based Speech Enhancement | May 7, 2025 | Robust Speech RecognitionSpeech Enhancement | —Unverified | 0 |
| Dysarthria Normalization via Local Lie Group Transformations for Robust ASR | Apr 16, 2025 | Robust Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens | Mar 14, 2025 | Audio-Visual Speech RecognitionComputational Efficiency | CodeCode Available | 1 |
| MoHAVE: Mixture of Hierarchical Audio-Visual Experts for Robust Speech Recognition | Feb 11, 2025 | Audio-Visual Speech RecognitionComputational Efficiency | —Unverified | 0 |
| mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech Recognition | Feb 3, 2025 | Audio-Visual Speech RecognitionDecoder | CodeCode Available | 3 |
| Data-Driven Mispronunciation Pattern Discovery for Robust Speech Recognition | Feb 1, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Privacy-Preserving Edge Speech Understanding with Tiny Foundation Models | Jan 29, 2025 | Privacy PreservingRobust Speech Recognition | —Unverified | 0 |
| Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition | Sep 26, 2024 | DecoderRobust Speech Recognition | —Unverified | 0 |
| Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech Recognition | Sep 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |