SOTAVerified

Visual Speech Recognition

Papers

Showing 131140 of 182 papers

TitleStatusHype
Adapter-Based Multi-Agent AVSR Extension for Pre-Trained ASR Models0
Resolution limits on visual speech recognition0
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement0
ReVISE: Self-Supervised Speech Resynthesis With Visual Input for Universal and Generalized Speech Regeneration0
Audio Visual Speech Recognition using Deep Recurrent Neural Networks0
RUSAVIC Corpus: Russian Audio-Visual Speech in Cars0
Scaling and Enhancing LLM-based AVSR: A Sparse Mixture of Projectors Approach0
Audio-Visual Speech Recognition is Worth 32328 Voxels0
SlideAVSR: A Dataset of Paper Explanation Videos for Audio-Visual Speech Recognition0
SparseVSR: Lightweight and Noise Robust Visual Speech Recognition0
Show:102550
← PrevPage 14 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)30.7Unverified
2CTC/AttentionWord Error Rate (WER)19.1Unverified
#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)22.6Unverified