SOTAVerified

Visual Speech Recognition

Papers

Showing 161170 of 182 papers

TitleStatusHype
VATLM: Visual-Audio-Text Pre-Training with Unified Masked Prediction for Speech Representation Learning0
ViCocktail: Automated Multi-Modal Data Collection for Vietnamese Audio-Visual Speech Recognition0
Video-Based Action Recognition Using Rate-Invariant Analysis of Covariance Trajectories0
Visual-Aware Speech Recognition for Noisy Scenarios0
ASR is all you need: cross-modal distillation for lip reading0
Visual-Only Recognition of Normal, Whispered and Silent Speech0
VisualSpeaker: Visually-Guided 3D Avatar Lip Synthesis0
Visual Speech Recognition0
Visual speech recognition: aligning terminologies for better understanding0
Another Point of View on Visual Speech Recognition0
Show:102550
← PrevPage 17 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)30.7Unverified
2CTC/AttentionWord Error Rate (WER)19.1Unverified
#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)22.6Unverified