| Robust Speech Recognition via Large-Scale Weak Supervision | Dec 6, 2022 | Robust Speech Recognitionspeech-recognition | CodeCode Available | 8 |
| mWhisper-Flamingo for Multilingual Audio-Visual Noise-Robust Speech Recognition | Feb 3, 2025 | Audio-Visual Speech RecognitionDecoder | CodeCode Available | 3 |
| Large Language Models are Efficient Learners of Noise-Robust Speech Recognition | Jan 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text Translation | Mar 1, 2023 | Audio-Visual Speech RecognitionRobust Speech Recognition | CodeCode Available | 2 |
| Audio-Visual Efficient Conformer for Robust Speech Recognition | Jan 4, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| An Investigation of End-to-End Models for Robust Speech Recognition | Feb 11, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT | Jun 29, 2023 | Automatic Lyrics TranscriptionLanguage Modeling | CodeCode Available | 1 |
| Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition | Feb 22, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition | Oct 11, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representations | Oct 5, 2022 | Automatic Speech Recognition (ASR)Clustering | CodeCode Available | 1 |