SOTAVerified

Lip Reading

Lip Reading is a task to infer the speech content in a video by using only the visual information, especially the lip movements. It has many crucial applications in practice, such as assisting audio-based speech recognition, biometric authentication and aiding hearing-impaired people.

Source: Mutual Information Maximization for Effective Lip Reading

Papers

Showing 2130 of 153 papers

TitleStatusHype
Can We Read Speech Beyond the Lips? Rethinking RoI Selection for Deep Visual Speech RecognitionCode1
Deep Audio-Visual Speech RecognitionCode1
Distinguishing Homophenes Using Multi-Head Visual-Audio Memory for Lip ReadingCode1
Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face VideoCode1
Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech RecognitionCode1
Personalized Lip Reading: Adapting to Your Unique Lip Movements with Vision and LanguageCode1
Contrastive Learning of Global-Local Video RepresentationsCode1
Lipreading using Temporal Convolutional NetworksCode1
LipVoicer: Generating Speech from Silent Videos Guided by Lip ReadingCode1
OLKAVS: An Open Large-Scale Korean Audio-Visual Speech DatasetCode1
Show:102550
← PrevPage 3 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Lip2WavWER14.08Unverified
#ModelMetricClaimedVerifiedStatus
1Lip2WavWER34.2Unverified
#ModelMetricClaimedVerifiedStatus
1Lip2WavWER31.26Unverified