SOTAVerified

Lip to Speech Synthesis

Given a silent video of a speaker, generate the corresponding speech that matches the lip movements.

Papers

Showing 110 of 13 papers

TitleStatusHype
Intelligible Lip-to-Speech Synthesis with Speech UnitsCode1
Lip-to-Speech Synthesis in the Wild with Multi-task LearningCode1
Show Me Your Face, And I'll Tell You How You SpeakCode1
Lip to Speech Synthesis with Visual Context Attentional GANCode1
Learning Individual Speaking Styles for Accurate Lip to Speech SynthesisCode1
FastLTS: Non-Autoregressive End-to-End Unconstrained Lip-to-Speech SynthesisCode0
Towards a practical lip-to-speech conversion system using deep neural networks and mobile application frontend0
Zero-shot personalized lip-to-speech synthesis with face image based voice control0
Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild0
NaturalL2S: End-to-End High-quality Multispeaker Lip-to-Speech Synthesis with Differential Digital Signal Processing0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Lip2WavESTOI0.34Unverified