| Robust Speech Recognition via Large-Scale Weak Supervision | Dec 6, 2022 | Robust Speech Recognitionspeech-recognition | CodeCode Available | 8 | 5 |
| mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech Recognition | Feb 3, 2025 | Audio-Visual Speech RecognitionDecoder | CodeCode Available | 3 | 5 |
| Large Language Models are Efficient Learners of Noise-Robust Speech Recognition | Jan 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 | 5 |
| MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation | Mar 1, 2023 | Audio-Visual Speech RecognitionRobust Speech Recognition | CodeCode Available | 2 | 5 |
| An Investigation of End-to-End Models for Robust Speech Recognition | Feb 11, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition | Oct 11, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Multi-task self-supervised learning for Robust Speech Recognition | Jan 25, 2020 | Robust Speech RecognitionSelf-Supervised Learning | CodeCode Available | 1 | 5 |
| Speech Robust Bench: A Robustness Benchmark For Speech Recognition | Mar 8, 2024 | Adversarial RobustnessAutomatic Speech Recognition | CodeCode Available | 1 | 5 |
| Audio-Visual Efficient Conformer for Robust Speech Recognition | Jan 4, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition | Feb 22, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT | Jun 29, 2023 | Automatic Lyrics TranscriptionLanguage Modeling | CodeCode Available | 1 | 5 |
| MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens | Mar 14, 2025 | Audio-Visual Speech RecognitionComputational Efficiency | CodeCode Available | 1 | 5 |
| Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition | Mar 28, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representations | Oct 5, 2022 | Automatic Speech Recognition (ASR)Clustering | CodeCode Available | 1 | 5 |
| DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition | Aug 1, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| Dysarthria Normalization via Local Lie Group Transformations for Robust ASR | Apr 16, 2025 | Robust Speech Recognitionspeech-recognition | CodeCode Available | 0 | 5 |
| Learning Waveform-Based Acoustic Models using Deep Variational Convolutional Neural Networks | Jun 23, 2019 | Bayesian InferenceRobust Speech Recognition | CodeCode Available | 0 | 5 |
| Investigating Generative Adversarial Networks based Speech Dereverberation for Robust Speech Recognition | Mar 27, 2018 | Robust Speech RecognitionSpeech Dereverberation | CodeCode Available | 0 | 5 |
| Sequential Randomized Smoothing for Adversarially Robust Speech Recognition | Nov 5, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding | Jul 19, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech Recognition | Sep 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Speech-enhanced and Noise-aware Networks for Robust Speech Recognition | Mar 25, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Domain Adaptation Using Class Similarity for Robust Speech Recognition | Nov 5, 2020 | Domain AdaptationRobust Speech Recognition | CodeCode Available | 0 | 5 |
| Scalable Factorized Hierarchical Variational Autoencoder Training | Apr 9, 2018 | DisentanglementHyperparameter Optimization | CodeCode Available | 0 | 5 |
| Unsupervised Speech Domain Adaptation Based on Disentangled Representation Learning for Robust Speech Recognition | Apr 12, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Very Deep Convolutional Neural Networks for Robust Speech Recognition | Oct 2, 2016 | Robust Speech Recognitionspeech-recognition | CodeCode Available | 0 | 5 |
| Incorporating L2 Phonemes Using Articulatory Features for Robust Speech Recognition | Jun 5, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| 調變頻譜分解技術於強健語音辨識之研究 (Investigating Modulation Spectrum Factorization Techniques for Robust Speech Recognition) [In Chinese] | Dec 1, 2015 | Robust Speech Recognitionspeech-recognition | —Unverified | 0 | 0 |
| Investigation of Densely Connected Convolutional Networks with Domain Adversarial Learning for Noise Robust Speech Recognition | Dec 19, 2021 | Robust Speech Recognitionspeech-recognition | —Unverified | 0 | 0 |
| KinSPEAK: Improving speech recognition for Kinyarwanda via semi-supervised learning methods | Aug 23, 2023 | Robust Speech Recognitionspeech-recognition | —Unverified | 0 | 0 |
| Learning Noise-Invariant Representations for Robust Speech Recognition | Jul 17, 2018 | Data AugmentationRepresentation Learning | —Unverified | 0 | 0 |
| Learning Noise-Invariant Representations for Robust Speech Recognition | Oct 2, 2018 | Data AugmentationRepresentation Learning | —Unverified | 0 | 0 |
| Modality Attention for End-to-End Audio-visual Speech Recognition | Nov 13, 2018 | Audio-Visual Speech RecognitionRobust Speech Recognition | —Unverified | 0 | 0 |
| Modified SPLICE and its Extension to Non-Stereo Data for Noise Robust Speech Recognition | Jul 15, 2013 | Robust Speech Recognitionspeech-recognition | —Unverified | 0 | 0 |
| MoHAVE: Mixture of Hierarchical Audio-Visual Experts for Robust Speech Recognition | Feb 11, 2025 | Audio-Visual Speech RecognitionComputational Efficiency | —Unverified | 0 | 0 |
| Multilingual Audio-Visual Speech Recognition with Hybrid CTC/RNN-T Fast Conformer | Mar 14, 2024 | Audio-Visual Speech RecognitionRobust Speech Recognition | —Unverified | 0 | 0 |
| Multiple Confidence Gates For Joint Training Of SE And ASR | Apr 1, 2022 | Robust Speech RecognitionSpeech Enhancement | —Unverified | 0 | 0 |
| Multi-scale Octave Convolutions for Robust Speech Recognition | Oct 31, 2019 | Computational EfficiencyRobust Speech Recognition | —Unverified | 0 | 0 |
| Multi-Staged Cross-Lingual Acoustic Model Adaption for Robust Speech Recognition in Real-World Applications - A Case Study on German Oral History Interviews | May 1, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Noise-robust Speech Recognition with 10 Minutes Unparalleled In-domain Data | Mar 29, 2022 | Generative Adversarial NetworkRobust Speech Recognition | —Unverified | 0 | 0 |
| On combining features for single-channel robust speech recognition in reverberant environments | Jun 17, 2019 | Robust Speech Recognitionspeech-recognition | —Unverified | 0 | 0 |
| On monoaural speech enhancement for automatic recognition of real noisy speech using mixture invariant training | May 3, 2022 | Robust Speech RecognitionSpeech Enhancement | —Unverified | 0 | 0 |
| On the Use of Different Feature Extraction Methods for Linear and Non Linear kernels | Jun 27, 2014 | Robust Speech RecognitionSpeaker Identification | —Unverified | 0 | 0 |
| Optimized Power Normalized Cepstral Coefficients towards Robust Deep Speaker Verification | Sep 24, 2021 | Robust Speech RecognitionSpeaker Verification | —Unverified | 0 | 0 |
| Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition | Sep 26, 2024 | DecoderRobust Speech Recognition | —Unverified | 0 | 0 |
| Phone Based Keyword Spotting for Transcribing Very Low Resource Languages | Dec 1, 2021 | Dynamic Time WarpingKeyword Spotting | —Unverified | 0 | 0 |
| pMCT: Patched Multi-Condition Training for Robust Speech Recognition | Jul 11, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Privacy-Preserving Edge Speech Understanding with Tiny Foundation Models | Jan 29, 2025 | Privacy PreservingRobust Speech Recognition | —Unverified | 0 | 0 |
| Recurrent Models for Auditory Attention in Multi-Microphone Distance Speech Recognition | Nov 19, 2015 | Robust Speech RecognitionSpeech Enhancement | —Unverified | 0 | 0 |
| Reinforcement Learning Based Speech Enhancement for Robust Speech Recognition | Nov 10, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |