| A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition | Mar 7, 2024 | Audio-Visual Speech RecognitionKnowledge Distillation | CodeCode Available | 0 | 5 |
| Deep word embeddings for visual speech recognition | Oct 30, 2017 | Lipreadingspeech-recognition | CodeCode Available | 0 | 5 |
| SynesLM: A Unified Approach for Audio-visual Speech Recognition and Translation via Language Model and Synthetic Data | Aug 1, 2024 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 0 | 5 |
| LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild | Oct 16, 2018 | LipreadingLip Reading | CodeCode Available | 0 | 5 |
| Listening and Seeing Again: Generative Error Correction for Audio-Visual Speech Recognition | Jan 3, 2025 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 0 | 5 |
| LIP-RTVE: An Audiovisual Database for Continuous Spanish in the Wild | Nov 21, 2023 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 | 5 |
| LRS3-TED: a large-scale dataset for visual speech recognition | Sep 3, 2018 | Audio-Visual Speech Recognitionspeech-recognition | CodeCode Available | 0 | 5 |
| The NPU-ASLP System Description for Visual Speech Recognition in CNVSRC 2024 | Aug 5, 2024 | Decoderspeech-recognition | CodeCode Available | 0 | 5 |
| Combining Residual Networks with LSTMs for Lipreading | Mar 12, 2017 | LipreadingLip Reading | CodeCode Available | 0 | 5 |
| Audio-Visual Speech Recognition based on Regulated Transformer and Spatio-Temporal Fusion Strategy for Driver Assistive Systems | May 9, 2024 | Audio-Visual Speech RecognitionLipreading | CodeCode Available | 0 | 5 |