| Multi-modal Dense Video Captioning | Mar 17, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models | May 9, 2024 | Adversarial AttackAutomatic Speech Recognition | CodeCode Available | 1 |
| A Toolbox for Construction and Analysis of Speech Datasets | Apr 11, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Neural Predictor for Black-Box Adversarial Attacks on Speech Recognition | Mar 18, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Attentive Sequence-to-Sequence Learning for Diacritic Restoration of Yorùbá Language Text | Apr 3, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models | Feb 9, 2025 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 1 |
| Open Source Automatic Speech Recognition for German | Jul 26, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Attention-based Audio-Visual Fusion for Robust Automatic Speech Recognition | Sep 5, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| A transfer learning based approach for pronunciation scoring | Nov 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Attention-based Contextual Language Model Adaptation for Speech Recognition | Jun 2, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |