SOTAVerified

Visual Speech Recognition

Papers

Showing 126150 of 182 papers

TitleStatusHype
Prompt Tuning of Deep Neural Networks for Speaker-adaptive Visual Speech Recognition0
Quantitative Analysis of Audio-Visual Tasks: An Information-Theoretic Perspective0
Rate-Invariant Analysis of Trajectories on Riemannian Manifolds with Application in Visual Speech Recognition0
Recent Progress in the CUHK Dysarthric Speech Recognition System0
Recognition of Isolated Words using Zernike and MFCC features for Audio Visual Speech Recognition0
Adapter-Based Multi-Agent AVSR Extension for Pre-Trained ASR Models0
Resolution limits on visual speech recognition0
ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement0
ReVISE: Self-Supervised Speech Resynthesis With Visual Input for Universal and Generalized Speech Regeneration0
Audio Visual Speech Recognition using Deep Recurrent Neural Networks0
RUSAVIC Corpus: Russian Audio-Visual Speech in Cars0
Scaling and Enhancing LLM-based AVSR: A Sparse Mixture of Projectors Approach0
Audio-Visual Speech Recognition is Worth 32328 Voxels0
SlideAVSR: A Dataset of Paper Explanation Videos for Audio-Visual Speech Recognition0
SparseVSR: Lightweight and Noise Robust Visual Speech Recognition0
Spatio-Temporal Attention Mechanism and Knowledge Distillation for Lip Reading0
Speaker-Adapted End-to-End Visual Speech Recognition for Continuous Spanish0
Streaming Audio-Visual Speech Recognition with Alignment Regularization0
Sub-word Level Lip Reading With Visual Attention0
SUTAV: A Turkish Audio-Visual Database0
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer0
Audio-Visual Speech and Gesture Recognition by Sensors of Mobile Devices0
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision0
Audio-visual Recognition of Overlapped speech for the LRS2 dataset0
Task-dependent modulation of the visual sensory thalamus assists visual-speech recognition0
Show:102550
← PrevPage 6 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)30.7Unverified
2CTC/AttentionWord Error Rate (WER)19.1Unverified
#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)22.6Unverified