SOTAVerified

Visual Speech Recognition

Papers

Showing 101150 of 182 papers

TitleStatusHype
Kaggle Competition: Cantonese Audio-Visual Speech Recognition for In-car Commands0
Lip-Listening: Mixing Senses to Understand Lips using Cross Modality Knowledge Distillation for Word-Based Models0
CI-AVSR: A Cantonese Audio-Visual Speech Datasetfor In-car Command RecognitionCode1
RUSAVIC Corpus: Russian Audio-Visual Speech in Cars0
Is Lip Region-of-Interest Sufficient for Lipreading?0
Deep Learning for Visual Speech Analysis: A Survey0
Visual Speech Recognition for Multiple Languages in the WildCode2
Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech RecognitionCode1
Learning Contextually Fused Audio-visual Representations for Audio-visual Speech Recognition0
Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Multi-Person Video0
Recent Progress in the CUHK Dysarthric Speech Recognition System0
CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command RecognitionCode1
Robust Self-Supervised Audio-Visual Speech RecognitionCode2
Leveraging Uni-Modal Self-Supervised Learning for Multimodal Audio-visual Speech Recognition0
Advances and Challenges in Deep Lip Reading0
Sub-word Level Lip Reading With Visual Attention0
Perception Point: Identifying Critical Learning Periods in Speech for Bilingual Networks0
Audio-Visual Speech Recognition is Worth 32328 Voxels0
LRWR: Large-Scale Benchmark for Lip Reading in Russian language0
Large-vocabulary Audio-visual Speech Recognition in Noisy Environments0
Spatio-Temporal Attention Mechanism and Knowledge Distillation for Lip Reading0
Interactive decoding of words from visual speech recognition models0
Fusing information streams in end-to-end audio-visual speech recognition0
End-to-end Audio-visual Speech Recognition with ConformersCode1
Part-based Lipreading for Audio-Visual Speech Recognition0
AV Taris: Online Audio-Visual Speech RecognitionCode1
Lips Don't Lie: A Generalisable and Robust Approach to Face Forgery DetectionCode1
Learn an Effective Lip Reading Model without PainsCode1
Lip Graph Assisted Audio-Visual Speech Recognition Using Bidirectional Synchronous Fusion0
"Notic My Speech" -- Blending Speech Patterns With Multimedia0
Should we hard-code the recurrence concept or learn it instead ? Exploring the Transformer architecture for Audio-Visual Speech RecognitionCode1
How to Teach DNNs to Pay Attention to the Visual Modality in Speech RecognitionCode1
Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech RecognitionCode1
Audio-visual Recognition of Overlapped speech for the LRS2 dataset0
Detecting Adversarial Attacks On Audiovisual Speech Recognition0
Continuous Speech Recognition using EEG and Video0
ASR is all you need: cross-modal distillation for lip reading0
Recurrent Neural Network Transducer for Audio-Visual Speech RecognitionCode0
Investigating the Lombard Effect Influence on End-to-End Audio-Visual Speech Recognition0
MobiVSR: A Visual Speech Recognition Solution for Mobile Devices0
End-to-End Visual Speech Recognition for Small-Scale Datasets0
Harnessing GANs for Zero-shot Learning of New Classes in Visual Speech RecognitionCode0
Modality Attention for End-to-End Audio-visual Speech Recognition0
LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the WildCode0
3D Feature Pyramid Attention Module for Robust Visual Speech Recognition0
Audio-Visual Speech Recognition With A Hybrid CTC/Attention Architecture0
Perfect match: Improved cross-modal embeddings for audio-visual synchronisation0
Deep Audio-Visual Speech RecognitionCode1
LRS3-TED: a large-scale dataset for visual speech recognitionCode0
Zero-shot keyword spotting for visual speech recognition in-the-wildCode1
Show:102550
← PrevPage 3 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)30.7Unverified
2CTC/AttentionWord Error Rate (WER)19.1Unverified
#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)22.6Unverified