SOTAVerified

Lip to Speech Synthesis

Given a silent video of a speaker, generate the corresponding speech that matches the lip movements.

Papers

Showing 1113 of 13 papers

TitleStatusHype
On the Audio-visual Synchronization for Lip-to-Speech Synthesis0
RobustL2S: Speaker-Specific Lip-to-Speech Synthesis exploiting Self-Supervised Representations0
FastLTS: Non-Autoregressive End-to-End Unconstrained Lip-to-Speech SynthesisCode0
Show:102550
← PrevPage 2 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Lip2WavESTOI0.34Unverified