SOTAVerified

Visual Speech Recognition

Papers

Showing 7180 of 182 papers

TitleStatusHype
MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech RecognitionCode1
Automated Speaker Independent Visual Speech Recognition: A Comprehensive Survey0
OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality AlignmentCode1
MAVD: The First Open Large-Scale Mandarin Audio-Visual Dataset with Depth InformationCode1
Improving the Gap in Visual Speech Recognition Between Normal and Silent Speech Based on Metric Learning0
Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task GeneralizationCode1
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech RecognitionCode1
Multi-Temporal Lip-Audio Memory for Visual Speech Recognition0
Deep Learning-based Spatio Temporal Facial Feature Visual Speech Recognition0
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision0
Show:102550
← PrevPage 8 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)30.7Unverified
2CTC/AttentionWord Error Rate (WER)19.1Unverified
#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)22.6Unverified