SOTAVerified

Talking Face Generation

Talking face generation aims to synthesize a sequence of face images that correspond to given speech semantics

( Image credit: Talking Face Generation by Adversarially Disentangled Audio-Visual Representation )

Papers

Showing 7180 of 110 papers

TitleStatusHype
Audio-Driven Talking Face Generation with Diverse yet Realistic Facial Animations0
Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation0
A Unified Compression Framework for Efficient Speech-Driven Talking-Face Generation0
AVI-Talking: Learning Audio-Visual Instructions for Expressive 3D Talking Face Generation0
CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding0
CPNet: Exploiting CLIP-based Attention Condenser and Probability Map Guidance for High-fidelity Talking Face Generation0
Cut Inner Layers: A Structured Pruning Strategy for Efficient U-Net GANs0
DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder0
Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation0
DisentTalk: Cross-lingual Talking Face Generation via Semantic Disentangled Diffusion Model0
Show:102550
← PrevPage 8 of 11Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EmoGenEmoAcc83.2Unverified
#ModelMetricClaimedVerifiedStatus
1LipGANLMD0.6Unverified