SOTAVerified

Visual Speech Recognition

Papers

Showing 91100 of 182 papers

TitleStatusHype
Comparison of Conventional Hybrid and CTC/Attention Decoders for Continuous Visual Speech Recognition0
SynesLM: A Unified Approach for Audio-visual Speech Recognition and Translation via Language Model and Synthetic Data0
Listening With Your Eyes: Towards a Practical Visual Speech Recognition System Using Deep Boltzmann Machines0
LiteVSR: Efficient Visual Speech Recognition by Learning from Speech Representations of Unlabeled Data0
Which phoneme-to-viseme maps best improve visual-only computer lip-reading?0
Adaptive Audio-Visual Speech Recognition via Matryoshka-Based Multimodal LLMs0
LRWR: Large-Scale Benchmark for Lip Reading in Russian language0
Manifold-Kernels Comparison in MKPLS for Visual Speech Recognition0
Combining Multiple Views for Visual Speech Recognition0
Cocktail-Party Audio-Visual Speech Recognition0
Show:102550
← PrevPage 10 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)30.7Unverified
2CTC/AttentionWord Error Rate (WER)19.1Unverified
#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)22.6Unverified