SOTAVerified

Visual Speech Recognition

Papers

Showing 151160 of 182 papers

TitleStatusHype
XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception0
SlideAVSR: A Dataset of Paper Explanation Videos for Audio-Visual Speech Recognition0
SparseVSR: Lightweight and Noise Robust Visual Speech Recognition0
Spatio-Temporal Attention Mechanism and Knowledge Distillation for Lip Reading0
Speaker-Adapted End-to-End Visual Speech Recognition for Continuous Spanish0
Streaming Audio-Visual Speech Recognition with Alignment Regularization0
Sub-word Level Lip Reading With Visual Attention0
SUTAV: A Turkish Audio-Visual Database0
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer0
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision0
Show:102550
← PrevPage 16 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)30.7Unverified
2CTC/AttentionWord Error Rate (WER)19.1Unverified
#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)22.6Unverified