Talking Head Generation

Talking head generation is the task of generating a talking face from a set of images of a person.

( Image credit: Few-Shot Adversarial Learning of Realistic Neural Talking Head Models )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 26–50 of 119 papers

Title	Date	Tasks	Status	Hype	Score
AnimeCeleb: Large-Scale Animation CelebHeads Dataset for Head Reenactment	Nov 15, 2021	ColorizationFace Reenactment	CodeCode Available	1	5
Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion	Jul 20, 2021	Image GenerationTalking Head Generation	CodeCode Available	1	5
Talking-head Generation with Rhythmic Head Motion	Jul 16, 2020	Talking Head Generation	CodeCode Available	1	5
DiffDub: Person-generic Visual Dubbing Using Inpainting Renderer with Diffusion Auto-encoder	Nov 3, 2023	Talking Face GenerationTalking Head Generation	CodeCode Available	1	5
Face Animation with an Attribute-Guided Diffusion Model	Apr 6, 2023	3D Face ReconstructionAttribute	CodeCode Available	1	5
Fast Bi-layer Neural Synthesis of One-Shot Realistic Head Avatars	Aug 24, 2020	Neural RenderingTalking Head Generation	CodeCode Available	1	5
Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head Generation	Jun 2, 2025	MisinformationTalking Head Generation	CodeCode Available	1	5
Perceptual Conversational Head Generation with Regularized Driver and Enhanced Renderer	Jun 26, 2022	Talking Head Generation	CodeCode Available	1	5
Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation	Apr 16, 2021	Face ModelRhythm	CodeCode Available	1	5
What comprises a good talking-head video generation?: A Survey and Benchmark	May 7, 2020	Talking Head GenerationVideo Generation	CodeCode Available	1	5
MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation	Aug 1, 2020	Face GenerationTalking Face Generation	CodeCode Available	1	5
Comparative Analysis of Audio Feature Extraction for Real-Time Talking Portrait Synthesis	Nov 20, 2024	Talking Head Generation	CodeCode Available	1	5
Autoregressive GAN for Semantic Unconditional Head Motion Generation	Nov 2, 2022	Motion GenerationTalking Head Generation	CodeCode Available	1	5
Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generation	Jun 2, 2023	3D Face AnimationTalking Head Generation	CodeCode Available	1	5
AI-generated characters for supporting personalized learning and well-being	Dec 15, 2021	Face ReenactmentNeural Rendering	CodeCode Available	1	5
Joint Co-Speech Gesture and Expressive Talking Face Generation using Diffusion with Adapters	Dec 18, 2024	Face GenerationTalking Face Generation	CodeCode Available	1	5
DisCoHead: Audio-and-Video-Driven Talking Head Generation by Disentangled Control of Head Pose and Facial Expressions	Mar 14, 2023	Talking Head Generation	CodeCode Available	1	5
GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expression	Dec 12, 2024	DisentanglementPortrait Animation	CodeCode Available	1	5
A Comparative Study of Perceptual Quality Metrics for Audio-driven Talking Head Videos	Mar 11, 2024	Talking Head Generation	CodeCode Available	1	5
EmoVOCA: Speech-Driven Emotional 3D Talking Heads	Mar 19, 2024	Talking Head Generation	CodeCode Available	1	5
Text-to-Video: a Two-stage Framework for Zero-shot Identity-agnostic Talking-head Generation	Aug 12, 2023	Talking Head Generationtext-to-speech	CodeCode Available	0	5
Neural Voice Puppetry: Audio-driven Facial Reenactment	Dec 11, 2019	Face ModelNeural Rendering	CodeCode Available	0	5
A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head Generation	Jul 4, 2023	Talking Head Generation	CodeCode Available	0	5
ReenactGAN: Learning to Reenact Faces via Boundary Transfer	Jul 29, 2018	DecoderFace Reenactment	CodeCode Available	0	5
EMOdiffhead: Continuously Emotional Control in Talking Head Generation via Diffusion	Sep 11, 2024	Portrait AnimationTalking Head Generation	CodeCode Available	0	5

Show:10 25 50

← PrevPage 2 of 5Next →

All datasets VoxCeleb2 - 1-shot learning VoxCeleb1 - 1-shot learning VoxCeleb1 - 32-shot learning VoxCeleb1 - 8-shot learning VoxCeleb2 - 8-shot learning 100 sleep nights of 8 caregivers VoxCeleb2 - 32-shot learning

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Few-shot Adversarial Model	FID	48.5	—	Unverified
2	CainGAN	FID	35	—	Unverified
3	Fast Bi-layer Avatars (medium size)	CSIM	0.65	—	Unverified
4	First Order Motion Model (medium size)	CSIM	0.64	—	Unverified
5	Few-shot Vid-to-vid (medium size)	CSIM	0.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	X2Face	FID	45.8	—	Unverified
2	Few-shot Adversarial Model	FID	43	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	X2Face	FID	56.5	—	Unverified
2	Few-shot Adversarial Model	FID	29.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	X2Face	FID	51.5	—	Unverified
2	Few-shot Adversarial Model	FID	38	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Few-shot Adversarial Model	FID	42.2	—	Unverified
2	CainGAN	FID	24.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Ashok	10%	12	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Few-shot Adversarial Model	FID	30.6	—	Unverified