SOTAVerified

Visual Speech Recognition

Papers

Showing 7180 of 182 papers

TitleStatusHype
AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model0
End-to-End Lip Reading in Romanian with Cross-Lingual Domain Adaptation and Lateral Inhibition0
Detecting Adversarial Attacks On Audiovisual Speech Recognition0
AV-data2vec: Self-supervised Learning of Audio-Visual Speech Representations with Contextualized Target Representations0
Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face Video0
AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition0
Deep Multimodal Representation Learning from Temporal Data0
Deep Multimodal Learning for Audio-Visual Speech Recognition0
Auxiliary Multimodal LSTM for Audio-visual Speech Recognition and Lipreading0
ASR is all you need: cross-modal distillation for lip reading0
Show:102550
← PrevPage 8 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)30.7Unverified
2CTC/AttentionWord Error Rate (WER)19.1Unverified
#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)22.6Unverified