SOTAVerified

Visual Speech Recognition

Papers

Showing 2130 of 182 papers

TitleStatusHype
Adapter-Based Multi-Agent AVSR Extension for Pre-Trained ASR Models0
Evaluation of End-to-End Continuous Spanish Lipreading in Different Data ConditionsCode0
Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech RepresentationCode1
LipGen: Viseme-Guided Lip Video Generation for Enhancing Visual Speech Recognition0
Listening and Seeing Again: Generative Error Correction for Audio-Visual Speech RecognitionCode0
Uncovering the Visual Contribution in Audio-Visual Speech Recognition0
AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech RecognitionCode1
Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective0
Large Language Models are Strong Audio-Visual Speech Recognition LearnersCode2
Enhancing CTC-Based Visual Speech Recognition0
Show:102550
← PrevPage 3 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)30.7Unverified
2CTC/AttentionWord Error Rate (WER)19.1Unverified
#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)22.6Unverified