| Learn2Talk: 3D Talking Face Learns from 2D Talking Face | Apr 19, 2024 | Audio-Visual Speech Recognitionspeech-recognition | —Unverified | 0 |
| XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception | Mar 21, 2024 | Audio-Visual Speech RecognitionRepresentation Learning | —Unverified | 0 |
| Multilingual Audio-Visual Speech Recognition with Hybrid CTC/RNN-T Fast Conformer | Mar 14, 2024 | Audio-Visual Speech RecognitionRobust Speech Recognition | —Unverified | 0 |
| A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition | Mar 7, 2024 | Audio-Visual Speech RecognitionKnowledge Distillation | CodeCode Available | 0 |
| JEP-KD: Joint-Embedding Predictive Architecture Based Knowledge Distillation for Visual Speech Recognition | Mar 4, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Comparison of Conventional Hybrid and CTC/Attention Decoders for Continuous Visual Speech Recognition | Feb 20, 2024 | Decoderspeech-recognition | —Unverified | 0 |
| SlideAVSR: A Dataset of Paper Explanation Videos for Audio-Visual Speech Recognition | Jan 18, 2024 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| MLCA-AVSR: Multi-Layer Cross Attention Fusion based Audio-Visual Speech Recognition | Jan 7, 2024 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation | Jan 7, 2024 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | CodeCode Available | 0 |
| LiteVSR: Efficient Visual Speech Recognition by Learning from Speech Representations of Unlabeled Data | Dec 15, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| The GUA-Speech System Description for CNVSRC Challenge 2023 | Dec 12, 2023 | DecoderLanguage Modeling | —Unverified | 0 |
| Speaker-Adapted End-to-End Visual Speech Recognition for Continuous Spanish | Nov 21, 2023 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| Analysis of Visual Features for Continuous Lipreading in Spanish | Nov 21, 2023 | Lipreadingspeech-recognition | —Unverified | 0 |
| LIP-RTVE: An Audiovisual Database for Continuous Spanish in the Wild | Nov 21, 2023 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| End-to-End Lip Reading in Romanian with Cross-Lingual Domain Adaptation and Lateral Inhibition | Oct 7, 2023 | Domain AdaptationLip Reading | —Unverified | 0 |
| AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition | Sep 29, 2023 | Audio-Visual Speech RecognitionAutomatic Speech Recognition | —Unverified | 0 |
| The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction | Sep 15, 2023 | Audio-Visual Speech Recognitionspeech-recognition | —Unverified | 0 |
| Another Point of View on Visual Speech Recognition | Aug 20, 2023 | Landmark-based Lipreadingspeech-recognition | —Unverified | 0 |
| AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model | Aug 15, 2023 | Quantizationspeech-recognition | —Unverified | 0 |
| Lip2Vec: Efficient and Robust Visual Speech Recognition via Latent-to-Latent Visual to Audio Representation Mapping | Aug 11, 2023 | Lip Readingspeech-recognition | —Unverified | 0 |
| SparseVSR: Lightweight and Noise Robust Visual Speech Recognition | Jul 10, 2023 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| Automated Speaker Independent Visual Speech Recognition: A Comprehensive Survey | Jun 14, 2023 | speech-recognitionSpeech Recognition | —Unverified | 0 |
| Improving the Gap in Visual Speech Recognition Between Normal and Silent Speech Based on Metric Learning | May 23, 2023 | Metric Learningspeech-recognition | —Unverified | 0 |
| Multi-Temporal Lip-Audio Memory for Visual Speech Recognition | May 8, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Deep Learning-based Spatio Temporal Facial Feature Visual Speech Recognition | Apr 30, 2023 | Deep LearningFace Recognition | —Unverified | 0 |