SOTAVerified

Visual Speech Recognition

Papers

Showing 1120 of 182 papers

TitleStatusHype
End-to-end Audio-visual Speech Recognition with ConformersCode1
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech RecognitionCode1
Deep Audio-Visual Speech RecognitionCode1
Do VSR Models Generalize Beyond LRS3?Code1
Jointly Learning Visual and Auditory Speech Representations from Raw DataCode1
Learn an Effective Lip Reading Model without PainsCode1
CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command RecognitionCode1
CI-AVSR: A Cantonese Audio-Visual Speech Datasetfor In-car Command RecognitionCode1
How to Teach DNNs to Pay Attention to the Visual Modality in Speech RecognitionCode1
Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech RecognitionCode1
Show:102550
← PrevPage 2 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)30.7Unverified
2CTC/AttentionWord Error Rate (WER)19.1Unverified
#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)22.6Unverified