SOTAVerified

Visual Speech Recognition

Papers

Showing 110 of 182 papers

TitleStatusHype
VisualSpeaker: Visually-Guided 3D Avatar Lip Synthesis0
ViCocktail: Automated Multi-Modal Data Collection for Vietnamese Audio-Visual Speech Recognition0
Cocktail-Party Audio-Visual Speech Recognition0
CNVSRC 2024: The Second Chinese Continuous Visual Speech Recognition Challenge0
Leveraging Large Language Models in Visual Speech Recognition: Model Scaling, Context-Aware Decoding, and Iterative Polishing0
Transfer Learning from Visual Speech Recognition to Mouthing Recognition in German Sign LanguageCode0
Scaling and Enhancing LLM-based AVSR: A Sparse Mixture of Projectors Approach0
The Multimodal Information Based Speech Processing (MISP) 2025 Challenge: Audio-Visual Diarization and Recognition0
SwinLip: An Efficient Visual Speech Encoder for Lip Reading Using Swin Transformer0
CoGenAV: Versatile Audio-Visual Representation Learning via Contrastive-Generative SynchronizationCode2
Show:102550
← PrevPage 1 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)22.6Unverified