| XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception | Mar 21, 2024 | Audio-Visual Speech RecognitionRepresentation Learning | —Unverified | 0 |
| SlideAVSR: A Dataset of Paper Explanation Videos for Audio-Visual Speech Recognition | Jan 18, 2024 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| SparseVSR: Lightweight and Noise Robust Visual Speech Recognition | Jul 10, 2023 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| Spatio-Temporal Attention Mechanism and Knowledge Distillation for Lip Reading | Aug 7, 2021 | Audio-Visual Speech RecognitionKnowledge Distillation | —Unverified | 0 |
| Speaker-Adapted End-to-End Visual Speech Recognition for Continuous Spanish | Nov 21, 2023 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| Streaming Audio-Visual Speech Recognition with Alignment Regularization | Nov 3, 2022 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| Sub-word Level Lip Reading With Visual Attention | Oct 14, 2021 | Audio-Visual Active Speaker DetectionAutomatic Speech Recognition | —Unverified | 0 |
| SUTAV: A Turkish Audio-Visual Database | May 1, 2012 | Audio-Visual Speech RecognitionPerson Identification | —Unverified | 0 |
| SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer | May 7, 2025 | Audio-Visual Speech RecognitionLip Reading | —Unverified | 0 |
| SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision | Mar 30, 2023 | Lip Readingspeech-recognition | —Unverified | 0 |