SOTAVerified

Visual Speech Recognition

Papers

Showing 8190 of 182 papers

TitleStatusHype
Auto-AVSR: Audio-Visual Speech Recognition with Automatic LabelsCode2
Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability ScoringCode1
The NPU-ASLP System for Audio-Visual Speech Recognition in MISP 2022 Challenge0
MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and RecognitionCode1
MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text TranslationCode2
Deep Visual Forced Alignment: Learning to Align Transcription with Talking Face Video0
Conformers are All You Need for Visual Speech Recognition0
Audio-Visual Speech and Gesture Recognition by Sensors of Mobile Devices0
Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech Recognition0
AV-data2vec: Self-supervised Learning of Audio-Visual Speech Representations with Contextualized Target Representations0
Show:102550
← PrevPage 9 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)30.7Unverified
2CTC/AttentionWord Error Rate (WER)19.1Unverified
#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)22.6Unverified