| SimClass: A Classroom Speech Dataset Generated via Game Engine Simulation For Automatic Speech Recognition Research | Jun 10, 2025 | Automatic Speech RecognitionData Augmentation | —Unverified | 0 |
| Robust Speech Recognition with Schrödinger Bridge-Based Speech Enhancement | May 7, 2025 | Robust Speech RecognitionSpeech Enhancement | —Unverified | 0 |
| Dysarthria Normalization via Local Lie Group Transformations for Robust ASR | Apr 16, 2025 | Robust Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens | Mar 14, 2025 | Audio-Visual Speech RecognitionComputational Efficiency | CodeCode Available | 1 |
| MoHAVE: Mixture of Hierarchical Audio-Visual Experts for Robust Speech Recognition | Feb 11, 2025 | Audio-Visual Speech RecognitionComputational Efficiency | —Unverified | 0 |
| mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech Recognition | Feb 3, 2025 | Audio-Visual Speech RecognitionDecoder | CodeCode Available | 3 |
| Data-Driven Mispronunciation Pattern Discovery for Robust Speech Recognition | Feb 1, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Privacy-Preserving Edge Speech Understanding with Tiny Foundation Models | Jan 29, 2025 | Privacy PreservingRobust Speech Recognition | —Unverified | 0 |
| Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition | Sep 26, 2024 | DecoderRobust Speech Recognition | —Unverified | 0 |
| Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech Recognition | Sep 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| Robust Audiovisual Speech Recognition Models with Mixture-of-Experts | Sep 19, 2024 | Mixture-of-ExpertsRobust Speech Recognition | —Unverified | 0 |
| CPT-Boosted Wav2vec2.0: Towards Noise Robust Speech Recognition for Classroom Environments | Sep 13, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception | Mar 21, 2024 | Audio-Visual Speech RecognitionRepresentation Learning | —Unverified | 0 |
| Multilingual Audio-Visual Speech Recognition with Hybrid CTC/RNN-T Fast Conformer | Mar 14, 2024 | Audio-Visual Speech RecognitionRobust Speech Recognition | —Unverified | 0 |
| Speech Robust Bench: A Robustness Benchmark For Speech Recognition | Mar 8, 2024 | Adversarial RobustnessAutomatic Speech Recognition | CodeCode Available | 1 |
| Large Language Models are Efficient Learners of Noise-Robust Speech Recognition | Jan 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| KinSPEAK: Improving speech recognition for Kinyarwanda via semi-supervised learning methods | Aug 23, 2023 | Robust Speech Recognitionspeech-recognition | —Unverified | 0 |
| LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT | Jun 29, 2023 | Automatic Lyrics TranscriptionLanguage Modeling | CodeCode Available | 1 |
| The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios | Jun 23, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Statistical Beamformer Exploiting Non-stationarity and Sparsity with Spatially Constrained ICA for Robust Speech Recognition | Jun 13, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| RescueSpeech: A German Corpus for Speech Recognition in Search and Rescue Domain | Jun 6, 2023 | Decision MakingRobust Speech Recognition | —Unverified | 0 |
| Incorporating L2 Phonemes Using Articulatory Features for Robust Speech Recognition | Jun 5, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR | Mar 29, 2023 | Automatic Speech RecognitionDomain Adaptation | —Unverified | 0 |
| MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation | Mar 1, 2023 | Audio-Visual Speech RecognitionRobust Speech Recognition | CodeCode Available | 2 |
| Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition | Feb 22, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |