| Listening and Seeing Again: Generative Error Correction for Audio-Visual Speech Recognition | Jan 3, 2025 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 0 | 5 |
| LRS3-TED: a large-scale dataset for visual speech recognition | Sep 3, 2018 | Audio-Visual Speech Recognitionspeech-recognition | CodeCode Available | 0 | 5 |
| Fusing information streams in end-to-end audio-visual speech recognition | Apr 19, 2021 | Audio-Visual Speech RecognitionLip Reading | —Unverified | 0 | 0 |
| Chinese-LiPS: A Chinese audio-visual speech recognition dataset with Lip-reading and Presentation Slides | Apr 21, 2025 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 | 0 |
| Audio-visual Recognition of Overlapped speech for the LRS2 dataset | Jan 6, 2020 | Audio-Visual Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Enhancing CTC-Based Visual Speech Recognition | Sep 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| End-To-End Visual Speech Recognition With LSTMs | Jan 20, 2017 | ClassificationGeneral Classification | —Unverified | 0 | 0 |
| End-to-End Visual Speech Recognition for Small-Scale Datasets | Apr 2, 2019 | General Classificationspeech-recognition | —Unverified | 0 | 0 |
| Building a synchronous corpus of acoustic and 3D facial marker data for adaptive audio-visual speech synthesis | May 1, 2012 | Audio-Visual Speech RecognitionSpeech Recognition | —Unverified | 0 | 0 |
| A three-dimensional approach to Visual Speech Recognition using Discrete Cosine Transforms | Sep 7, 2016 | speech-recognitionSpeech Recognition | —Unverified | 0 | 0 |