SOTAVerified

Visual Speech Recognition

Papers

Showing 131140 of 182 papers

TitleStatusHype
Enhancing CTC-Based Visual Speech Recognition0
Fusing information streams in end-to-end audio-visual speech recognition0
Improving the Gap in Visual Speech Recognition Between Normal and Silent Speech Based on Metric Learning0
Interactive decoding of words from visual speech recognition models0
Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition0
Is Lip Region-of-Interest Sufficient for Lipreading?0
SynesLM: A Unified Approach for Audio-visual Speech Recognition and Translation via Language Model and Synthetic Data0
Uncovering the Visual Contribution in Audio-Visual Speech Recognition0
VATLM: Visual-Audio-Text Pre-Training with Unified Masked Prediction for Speech Representation Learning0
ViCocktail: Automated Multi-Modal Data Collection for Vietnamese Audio-Visual Speech Recognition0
Show:102550
← PrevPage 14 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)30.7Unverified
2CTC/AttentionWord Error Rate (WER)19.1Unverified
#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)22.6Unverified