| The GUA-Speech System Description for CNVSRC Challenge 2023 | Dec 12, 2023 | DecoderLanguage Modeling | —Unverified | 0 | 0 |
| The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction | Sep 15, 2023 | Audio-Visual Speech Recognitionspeech-recognition | —Unverified | 0 | 0 |
| The Multimodal Information Based Speech Processing (MISP) 2025 Challenge: Audio-Visual Diarization and Recognition | May 20, 2025 | Audio-Visual Speech Recognitionspeaker-diarization | —Unverified | 0 | 0 |
| A three-dimensional approach to Visual Speech Recognition using Discrete Cosine Transforms | Sep 7, 2016 | speech-recognitionSpeech Recognition | —Unverified | 0 | 0 |
| The NPU-ASLP System for Audio-Visual Speech Recognition in MISP 2022 Challenge | Mar 11, 2023 | Audio-Visual Speech Recognitionspeech-recognition | —Unverified | 0 | 0 |
| Towards Estimating the Upper Bound of Visual-Speech Recognition: The Visual Lip-Reading Feasibility Database | Apr 26, 2017 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Towards Lipreading Sentences with Active Appearance Models | May 29, 2018 | Audio-Visual Speech RecognitionLipreading | —Unverified | 0 | 0 |
| 3D Feature Pyramid Attention Module for Robust Visual Speech Recognition | Oct 15, 2018 | LipreadingSentence | —Unverified | 0 | 0 |
| Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Multi-Person Video | Jan 25, 2022 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 | 0 |
| Uncovering the Visual Contribution in Audio-Visual Speech Recognition | Dec 22, 2024 | Audio-Visual Speech RecognitionInformativeness | —Unverified | 0 | 0 |
| VATLM: Visual-Audio-Text Pre-Training with Unified Masked Prediction for Speech Representation Learning | Nov 21, 2022 | Audio-Visual Speech RecognitionLanguage Modelling | —Unverified | 0 | 0 |
| ViCocktail: Automated Multi-Modal Data Collection for Vietnamese Audio-Visual Speech Recognition | Jun 5, 2025 | Audio-Visual Speech Recognitionspeech-recognition | —Unverified | 0 | 0 |
| Video-Based Action Recognition Using Rate-Invariant Analysis of Covariance Trajectories | Mar 23, 2015 | Action RecognitionGeneral Classification | —Unverified | 0 | 0 |
| Visual-Aware Speech Recognition for Noisy Scenarios | Apr 9, 2025 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 | 0 |
| ASR is all you need: cross-modal distillation for lip reading | Nov 28, 2019 | AllAutomatic Speech Recognition | —Unverified | 0 | 0 |
| Visual-Only Recognition of Normal, Whispered and Silent Speech | Feb 18, 2018 | Silent Speech Recognitionspeech-recognition | —Unverified | 0 | 0 |
| VisualSpeaker: Visually-Guided 3D Avatar Lip Synthesis | Jul 8, 2025 | Automatic Speech RecognitionLip Reading | —Unverified | 0 | 0 |
| Visual Speech Recognition | Sep 3, 2014 | Audio-Visual Speech RecognitionLip Reading | —Unverified | 0 | 0 |
| Visual speech recognition: aligning terminologies for better understanding | Oct 3, 2017 | Lipreadingspeech-recognition | —Unverified | 0 | 0 |
| Another Point of View on Visual Speech Recognition | Aug 20, 2023 | Landmark-based Lipreadingspeech-recognition | —Unverified | 0 | 0 |
| Analysis of Visual Features for Continuous Lipreading in Spanish | Nov 21, 2023 | Lipreadingspeech-recognition | —Unverified | 0 | 0 |
| Visual Speech Recognition in a Driver Assistance System | Aug 29, 2022 | Data AugmentationLipreading | —Unverified | 0 | 0 |
| Visual Speech Recognition Using PCA Networks and LSTMs in a Tandem GMM-HMM System | Oct 19, 2017 | Sentencespeech-recognition | —Unverified | 0 | 0 |
| Detecting Adversarial Attacks On Audiovisual Speech Recognition | Dec 18, 2019 | Audio-Visual Speech Recognitionspeech-recognition | —Unverified | 0 | 0 |
| End-to-End Lip Reading in Romanian with Cross-Lingual Domain Adaptation and Lateral Inhibition | Oct 7, 2023 | Domain AdaptationLip Reading | —Unverified | 0 | 0 |
| End-to-End Visual Speech Recognition for Small-Scale Datasets | Apr 2, 2019 | General Classificationspeech-recognition | —Unverified | 0 | 0 |
| End-To-End Visual Speech Recognition With LSTMs | Jan 20, 2017 | ClassificationGeneral Classification | —Unverified | 0 | 0 |
| Enhancing CTC-Based Visual Speech Recognition | Sep 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Visual Words for Automatic Lip-Reading | Sep 17, 2014 | Lip Readingspeech-recognition | —Unverified | 0 | 0 |
| Fusing information streams in end-to-end audio-visual speech recognition | Apr 19, 2021 | Audio-Visual Speech RecognitionLip Reading | —Unverified | 0 | 0 |
| A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset | Jan 21, 2023 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 | 0 |
| Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face Video | Feb 27, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |