SOTAVerified

Visual Speech Recognition

Papers

Showing 101125 of 182 papers

TitleStatusHype
Kaggle Competition: Cantonese Audio-Visual Speech Recognition for In-car Commands0
Lip-Listening: Mixing Senses to Understand Lips using Cross Modality Knowledge Distillation for Word-Based Models0
CI-AVSR: A Cantonese Audio-Visual Speech Datasetfor In-car Command RecognitionCode1
RUSAVIC Corpus: Russian Audio-Visual Speech in Cars0
Is Lip Region-of-Interest Sufficient for Lipreading?0
Deep Learning for Visual Speech Analysis: A Survey0
Visual Speech Recognition for Multiple Languages in the WildCode2
Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech RecognitionCode1
Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition0
Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Multi-Person Video0
Recent Progress in the CUHK Dysarthric Speech Recognition System0
CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command RecognitionCode1
Robust Self-Supervised Audio-Visual Speech RecognitionCode2
Leveraging Uni-Modal Self-Supervised Learning for Multimodal Audio-visual Speech Recognition0
Advances and Challenges in Deep Lip Reading0
Sub-word Level Lip Reading With Visual Attention0
Perception Point: Identifying Critical Learning Periods in Speech for Bilingual Networks0
Audio-Visual Speech Recognition is Worth 32328 Voxels0
LRWR: Large-Scale Benchmark for Lip Reading in Russian language0
Large-vocabulary Audio-visual Speech Recognition in Noisy Environments0
Spatio-Temporal Attention Mechanism and Knowledge Distillation for Lip Reading0
Interactive decoding of words from visual speech recognition models0
Fusing information streams in end-to-end audio-visual speech recognition0
End-to-end Audio-visual Speech Recognition with ConformersCode1
Part-based Lipreading for Audio-Visual Speech Recognition0
Show:102550
← PrevPage 5 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)30.7Unverified
2CTC/AttentionWord Error Rate (WER)19.1Unverified
#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)22.6Unverified