SOTAVerified

Visual Speech Recognition

Papers

Showing 121130 of 182 papers

TitleStatusHype
Spatio-Temporal Attention Mechanism and Knowledge Distillation for Lip Reading0
Interactive decoding of words from visual speech recognition models0
Fusing information streams in end-to-end audio-visual speech recognition0
End-to-end Audio-visual Speech Recognition with ConformersCode1
Part-based Lipreading for Audio-Visual Speech Recognition0
AV Taris: Online Audio-Visual Speech RecognitionCode1
Lips Don't Lie: A Generalisable and Robust Approach to Face Forgery DetectionCode1
Learn an Effective Lip Reading Model without PainsCode1
Lip Graph Assisted Audio-Visual Speech Recognition Using Bidirectional Synchronous Fusion0
"Notic My Speech" -- Blending Speech Patterns With Multimedia0
Show:102550
← PrevPage 13 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)30.7Unverified
2CTC/AttentionWord Error Rate (WER)19.1Unverified
#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)22.6Unverified