SOTAVerified

Visual Speech Recognition

Papers

Showing 91100 of 182 papers

TitleStatusHype
JEP-KD: Joint-Embedding Predictive Architecture Based Knowledge Distillation for Visual Speech Recognition0
3D Feature Pyramid Attention Module for Robust Visual Speech Recognition0
Adapter-Based Multi-Agent AVSR Extension for Pre-Trained ASR Models0
Adaptive Audio-Visual Speech Recognition via Matryoshka-Based Multimodal LLMs0
Advances and Challenges in Deep Lip Reading0
AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model0
A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset0
Analysis of Visual Features for Continuous Lipreading in Spanish0
Another Point of View on Visual Speech Recognition0
ASR is all you need: cross-modal distillation for lip reading0
Show:102550
← PrevPage 10 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)30.7Unverified
2CTC/AttentionWord Error Rate (WER)19.1Unverified
#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)22.6Unverified