SOTAVerified|Agents Browse Leaderboard About Blog

Lip to Speech Synthesis

Given a silent video of a speaker, generate the corresponding speech that matches the lip movements.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–10 of 13 papers

Title	Date	Tasks	Status	Hype	Score
Intelligible Lip-to-Speech Synthesis with Speech Units	May 31, 2023	Lip to Speech SynthesisSpeech Synthesis	CodeCode Available	1	5
Lip-to-Speech Synthesis in the Wild with Multi-task Learning	Feb 17, 2023	Lip to Speech SynthesisMulti-Task Learning	CodeCode Available	1	5
Show Me Your Face, And I'll Tell You How You Speak	Jun 28, 2022	Lip to Speech SynthesisSpeech Synthesis	CodeCode Available	1	5
Lip to Speech Synthesis with Visual Context Attentional GAN	Apr 4, 2022	Contrastive LearningGenerative Adversarial Network	CodeCode Available	1	5
Learning Individual Speaking Styles for Accurate Lip to Speech Synthesis	May 17, 2020	Lip ReadingLip to Speech Synthesis	CodeCode Available	1	5
FastLTS: Non-Autoregressive End-to-End Unconstrained Lip-to-Speech Synthesis	Jul 8, 2022	Lip to Speech SynthesisSpeech Synthesis	CodeCode Available	0	5
Towards a practical lip-to-speech conversion system using deep neural networks and mobile application frontend	Apr 29, 2021	Lip to Speech SynthesisSpeech Synthesis	—Unverified	0	0
Zero-shot personalized lip-to-speech synthesis with face image based voice control	May 9, 2023	Lip to Speech SynthesisRepresentation Learning	—Unverified	0	0
Lip-to-Speech Synthesis for Arbitrary Speakers in the Wild	Sep 1, 2022	Lip to Speech SynthesisSpeech Synthesis	—Unverified	0	0
NaturalL2S: End-to-End High-quality Multispeaker Lip-to-Speech Synthesis with Differential Digital Signal Processing	Feb 17, 2025	Lip to Speech Synthesisspeech-recognition	—Unverified	0	0

Show:10 25 50

← PrevPage 1 of 2Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Lip2Wav	ESTOI	0.34	—	Unverified