SOTAVerified

Visual Speech Recognition

Papers

Showing 151182 of 182 papers

TitleStatusHype
Large-Scale Visual Speech Recognition0
Deep Lip Reading: a comparison of models and an online application0
Towards Lipreading Sentences with Active Appearance Models0
Task-dependent modulation of the visual sensory thalamus assists visual-speech recognition0
Visual-Only Recognition of Normal, Whispered and Silent Speech0
Deep word embeddings for visual speech recognitionCode0
Combining Multiple Views for Visual Speech Recognition0
Visual Speech Recognition Using PCA Networks and LSTMs in a Tandem GMM-HMM System0
Which phoneme-to-viseme maps best improve visual-only computer lip-reading?0
Resolution limits on visual speech recognition0
Visual speech recognition: aligning terminologies for better understanding0
Multimodal Machine Learning: Integrating Language, Vision and Speech0
Towards Estimating the Upper Bound of Visual-Speech Recognition: The Visual Lip-Reading Feasibility Database0
Deep Multimodal Representation Learning from Temporal Data0
Combining Residual Networks with LSTMs for LipreadingCode0
End-To-End Visual Speech Recognition With LSTMs0
Auxiliary Multimodal LSTM for Audio-visual Speech Recognition and Lipreading0
Lip Reading Sentences in the Wild0
Audio Visual Speech Recognition using Deep Recurrent Neural Networks0
A three-dimensional approach to Visual Speech Recognition using Discrete Cosine Transforms0
Manifold-Kernels Comparison in MKPLS for Visual Speech Recognition0
Listening With Your Eyes: Towards a Practical Visual Speech Recognition System Using Deep Boltzmann Machines0
Video-Based Action Recognition Using Rate-Invariant Analysis of Covariance Trajectories0
Deep Multimodal Learning for Audio-Visual Speech Recognition0
Visual Words for Automatic Lip-Reading0
Visual Speech Recognition0
Recognition of Isolated Words using Zernike and MFCC features for Audio Visual Speech Recognition0
Preliminary Test of a Real-Time, Interactive Silent Speech Interface Based on Electromagnetic Articulograph0
Rate-Invariant Analysis of Trajectories on Riemannian Manifolds with Application in Visual Speech Recognition0
MKPLS: Manifold Kernel Partial Least Squares for Lipreading and Speaker Identification0
SUTAV: A Turkish Audio-Visual Database0
Building a synchronous corpus of acoustic and 3D facial marker data for adaptive audio-visual speech synthesis0
Show:102550
← PrevPage 4 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)30.7Unverified
2CTC/AttentionWord Error Rate (WER)19.1Unverified
#ModelMetricClaimedVerifiedStatus
1VTP with more dataWord Error Rate (WER)22.6Unverified