SOTAVerified

Talking Face Generation

Talking face generation aims to synthesize a sequence of face images that correspond to given speech semantics

( Image credit: Talking Face Generation by Adversarially Disentangled Audio-Visual Representation )

Papers

Showing 2130 of 110 papers

TitleStatusHype
Neural Text to Articulate Talk: Deep Text to Audiovisual Speech Synthesis achieving both Auditory and Photo-realismCode1
MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face GenerationCode1
KAN-Based Fusion of Dual-Domain for Audio-Driven Facial Landmarks GenerationCode1
Merkel Podcast Corpus: A Multimodal Dataset Compiled from 16 Years of Angela Merkel's Weekly Video PodcastsCode1
Controllable Talking Face Generation by Implicit Facial Keypoints EditingCode1
HDTR-Net: A Real-Time High-Definition Teeth Restoration Network for Arbitrary Talking Face Generation MethodsCode1
Parallel and High-Fidelity Text-to-Lip GenerationCode1
Joint Co-Speech Gesture and Expressive Talking Face Generation using Diffusion with AdaptersCode1
Flow-Guided One-Shot Talking Face Generation With a High-Resolution Audio-Visual DatasetCode1
FNeVR: Neural Volume Rendering for Face AnimationCode1
Show:102550
← PrevPage 3 of 11Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EmoGenEmoAcc83.2Unverified
#ModelMetricClaimedVerifiedStatus
1LipGANLMD0.6Unverified