Lip to Speech Synthesis
Given a silent video of a speaker, generate the corresponding speech that matches the lip movements.
Papers
Showing 1–10 of 13 papers
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Lip2Wav | ESTOI | 0.34 | — | Unverified |
Given a silent video of a speaker, generate the corresponding speech that matches the lip movements.
Showing 1–10 of 13 papers
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Lip2Wav | ESTOI | 0.34 | — | Unverified |