SOTAVerified

Visual Speech Recognition

Papers

Showing 101110 of 182 papers

TitleStatusHype
Kaggle Competition: Cantonese Audio-Visual Speech Recognition for In-car Commands0
Lip-Listening: Mixing Senses to Understand Lips using Cross Modality Knowledge Distillation for Word-Based Models0
CI-AVSR: A Cantonese Audio-Visual Speech Datasetfor In-car Command RecognitionCode1
RUSAVIC Corpus: Russian Audio-Visual Speech in Cars0
Is Lip Region-of-Interest Sufficient for Lipreading?0
Deep Learning for Visual Speech Analysis: A Survey0
Visual Speech Recognition for Multiple Languages in the WildCode2
Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech RecognitionCode1
Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition0
Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Multi-Person Video0
Show:102550
← PrevPage 11 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)30.7Unverified
2CTC/AttentionWord Error Rate (WER)19.1Unverified
#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)22.6Unverified