SOTAVerified

Visual Speech Recognition

Papers

Showing 101110 of 182 papers

TitleStatusHype
CNVSRC 2024: The Second Chinese Continuous Visual Speech Recognition Challenge0
MKPLS: Manifold Kernel Partial Least Squares for Lipreading and Speaker Identification0
MLCA-AVSR: Multi-Layer Cross Attention Fusion based Audio-Visual Speech Recognition0
CNVSRC 2023: The First Chinese Continuous Visual Speech Recognition Challenge0
MobiVSR: A Visual Speech Recognition Solution for Mobile Devices0
Modality Attention for End-to-End Audio-visual Speech Recognition0
MoHAVE: Mixture of Hierarchical Audio-Visual Experts for Robust Speech Recognition0
MSRS: Training Multimodal Speech Recognition Models from Scratch with Sparse Mask Optimization0
Chinese-LiPS: A Chinese audio-visual speech recognition dataset with Lip-reading and Presentation Slides0
XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception0
Show:102550
← PrevPage 11 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)30.7Unverified
2CTC/AttentionWord Error Rate (WER)19.1Unverified
#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)22.6Unverified