SOTAVerified

Visual Speech Recognition

Papers

Showing 151160 of 182 papers

TitleStatusHype
The GUA-Speech System Description for CNVSRC Challenge 20230
The Multimodal Information Based Speech Processing (MISP) 2023 Challenge: Audio-Visual Target Speaker Extraction0
The Multimodal Information Based Speech Processing (MISP) 2025 Challenge: Audio-Visual Diarization and Recognition0
A three-dimensional approach to Visual Speech Recognition using Discrete Cosine Transforms0
The NPU-ASLP System for Audio-Visual Speech Recognition in MISP 2022 Challenge0
Towards Estimating the Upper Bound of Visual-Speech Recognition: The Visual Lip-Reading Feasibility Database0
Towards Lipreading Sentences with Active Appearance Models0
3D Feature Pyramid Attention Module for Robust Visual Speech Recognition0
Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Multi-Person Video0
Uncovering the Visual Contribution in Audio-Visual Speech Recognition0
Show:102550
← PrevPage 16 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)30.7Unverified
2CTC/AttentionWord Error Rate (WER)19.1Unverified
#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)22.6Unverified