SOTAVerified

Talking Face Generation

Talking face generation aims to synthesize a sequence of face images that correspond to given speech semantics

( Image credit: Talking Face Generation by Adversarially Disentangled Audio-Visual Representation )

Papers

Showing 4150 of 110 papers

TitleStatusHype
Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation0
Emotional Conversation: Empowering Talking Faces with Cohesive Expression, Gaze and Pose Generation0
EmoTalker: Emotionally Editable Talking Face Generation via Diffusion Model0
CPNet: Exploiting CLIP-based Attention Condenser and Probability Map Guidance for High-fidelity Talking Face Generation0
EmoSpeaker: One-shot Fine-grained Emotion-Controlled Talking Face Generation0
Arbitrary Talking Face Generation via Attentional Audio-Visual Coherence Learning0
High-fidelity Generalized Emotional Talking Face Generation with Multi-modal Emotion Space Learning0
EMMN: Emotional Motion Memory Network for Audio-driven Emotional Talking Face Generation0
CP-EB: Talking Face Generation with Controllable Pose and Eye Blinking Embedding0
Audio-Driven Talking Face Generation with Diverse yet Realistic Facial Animations0
Show:102550
← PrevPage 5 of 11Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EmoGenEmoAcc83.2Unverified
#ModelMetricClaimedVerifiedStatus
1LipGANLMD0.6Unverified