| Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition | Feb 22, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| MMS-LLaMA: Efficient LLM-based Audio-Visual Speech Recognition with Minimal Multimodal Speech Tokens | Mar 14, 2025 | Audio-Visual Speech RecognitionComputational Efficiency | CodeCode Available | 1 | 5 |
| LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT | Jun 29, 2023 | Automatic Lyrics TranscriptionLanguage Modeling | CodeCode Available | 1 | 5 |
| Multi-task self-supervised learning for Robust Speech Recognition | Jan 25, 2020 | Robust Speech RecognitionSelf-Supervised Learning | CodeCode Available | 1 | 5 |
| Speech Robust Bench: A Robustness Benchmark For Speech Recognition | Mar 8, 2024 | Adversarial RobustnessAutomatic Speech Recognition | CodeCode Available | 1 | 5 |
| Learning Waveform-Based Acoustic Models using Deep Variational Convolutional Neural Networks | Jun 23, 2019 | Bayesian InferenceRobust Speech Recognition | CodeCode Available | 0 | 5 |
| Channel-Aware Domain-Adaptive Generative Adversarial Network for Robust Speech Recognition | Sep 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Investigating Generative Adversarial Networks based Speech Dereverberation for Robust Speech Recognition | Mar 27, 2018 | Robust Speech RecognitionSpeech Dereverberation | CodeCode Available | 0 | 5 |
| ESPnet-SE++: Speech Enhancement for Robust Speech Recognition, Translation, and Understanding | Jul 19, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Domain Adaptation Using Class Similarity for Robust Speech Recognition | Nov 5, 2020 | Domain AdaptationRobust Speech Recognition | CodeCode Available | 0 | 5 |