SOTAVerified

Visual Speech Recognition

Papers

Showing 8190 of 182 papers

TitleStatusHype
Continuous Speech Recognition using EEG and Video0
Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning0
Leveraging Large Language Models in Visual Speech Recognition: Model Scaling, Context-Aware Decoding, and Iterative Polishing0
Lip2Vec: Efficient and Robust Visual Speech Recognition via Latent-to-Latent Visual to Audio Representation Mapping0
LipGen: Viseme-Guided Lip Video Generation for Enhancing Visual Speech Recognition0
Lip Graph Assisted Audio-Visual Speech Recognition Using Bidirectional Synchronous Fusion0
Conformers are All You Need for Visual Speech Recognition0
Lip Reading Sentences in the Wild0
Audio Visual Speech Recognition using Deep Recurrent Neural Networks0
Analysis of Visual Features for Continuous Lipreading in Spanish0
Show:102550
← PrevPage 9 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)30.7Unverified
2CTC/AttentionWord Error Rate (WER)19.1Unverified
#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)22.6Unverified