SOTAVerified

Talking Face Generation

Talking face generation aims to synthesize a sequence of face images that correspond to given speech semantics

( Image credit: Talking Face Generation by Adversarially Disentangled Audio-Visual Representation )

Papers

Showing 2650 of 110 papers

TitleStatusHype
Controllable Talking Face Generation by Implicit Facial Keypoints EditingCode1
Merkel Podcast Corpus: A Multimodal Dataset Compiled from 16 Years of Angela Merkel’s Weekly Video PodcastsCode1
Merkel Podcast Corpus: A Multimodal Dataset Compiled from 16 Years of Angela Merkel's Weekly Video PodcastsCode1
MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face GenerationCode1
KAN-Based Fusion of Dual-Domain for Audio-Driven Facial Landmarks GenerationCode1
Joint Co-Speech Gesture and Expressive Talking Face Generation using Diffusion with AdaptersCode1
FNeVR: Neural Volume Rendering for Face AnimationCode1
DiffDub: Person-generic Visual Dubbing Using Inpainting Renderer with Diffusion Auto-encoderCode1
Flow-Guided One-Shot Talking Face Generation With a High-Resolution Audio-Visual DatasetCode1
FACIAL: Synthesizing Dynamic Talking Face with Implicit Attribute LearningCode1
Talking Face Generation by Adversarially Disentangled Audio-Visual RepresentationCode0
Talking Face Generation by Conditional Recurrent Adversarial NetworkCode0
Stochastic Talking Face Generation Using Latent Distribution MatchingCode0
Neural Voice Puppetry: Audio-driven Facial ReenactmentCode0
GSmoothFace: Generalized Smooth Talking Face Generation via Fine Grained 3D Face GuidanceCode0
Capture, Learning, and Synthesis of 3D Speaking StylesCode0
ReenactGAN: Learning to Reenact Faces via Boundary TransferCode0
AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head SynthesisCode0
Exploring Phonetic Context-Aware Lip-Sync For Talking Face Generation0
Emotion-Controllable Generalized Talking Face Generation0
DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder0
Emotional Talking Faces: Making Videos More Expressive and Realistic0
Cut Inner Layers: A Structured Pruning Strategy for Efficient U-Net GANs0
Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation0
Emotional Conversation: Empowering Talking Faces with Cohesive Expression, Gaze and Pose Generation0
Show:102550
← PrevPage 2 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EmoGenEmoAcc83.2Unverified
#ModelMetricClaimedVerifiedStatus
1LipGANLMD0.6Unverified