Talking Head Generation

Talking head generation is the task of generating a talking face from a set of images of a person.

( Image credit: Few-Shot Adversarial Learning of Realistic Neural Talking Head Models )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–25 of 119 papers

Title	Date	Tasks	Status	Hype
SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation	Nov 22, 2022	Image AnimationTalking Head Generation	CodeCode Available	6
Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation	Apr 3, 2025	MambaTalking Head Generation	CodeCode Available	3
DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation	Oct 17, 2024	Talking Head GenerationVideo Generation	CodeCode Available	3
DreamTalk: When Emotional Talking Head Generation Meets Diffusion Probabilistic Models	Dec 15, 2023	DenoisingTalking Head Generation	CodeCode Available	3
SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis	Nov 29, 2023	NeRFTalking Face Generation	CodeCode Available	3
A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild	Aug 23, 2020	AllMORPH	CodeCode Available	3
InsTaG: Learning Personalized 3D Talking Head from Few-Second Video	Feb 27, 2025	3DGSTalking Head Generation	CodeCode Available	2
Beyond Fixed Topologies: Unregistered Training and Comprehensive Evaluation Metrics for 3D Talking Heads	Oct 14, 2024	Talking Head Generation	CodeCode Available	2
MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation	Mar 28, 2024	Talking Head Generation	CodeCode Available	2
Adaptive Super Resolution For One-Shot Talking-Head Generation	Mar 23, 2024	DecoderSuper-Resolution	CodeCode Available	2
Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation	Sep 10, 2023	Talking Head Generation	CodeCode Available	2
Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head video Generation	Jul 19, 2023	Talking Head GenerationVideo Generation	CodeCode Available	2
RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars	May 22, 2023	2kImage Matting	CodeCode Available	2
DaGAN++: Depth-Aware Generative Adversarial Network for Talking Head Video Generation	May 10, 2023	3D geometryGenerative Adversarial Network	CodeCode Available	2
Emotionally Enhanced Talking Face Generation	Mar 21, 2023	Face GenerationTalking Face Generation	CodeCode Available	2
DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation	Jan 10, 2023	DenoisingTalking Head Generation	CodeCode Available	2
StyleTalk: One-shot Talking Head Generation with Controllable Speaking Styles	Jan 3, 2023	DecoderFace Generation	CodeCode Available	2
MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation	Dec 15, 2022	Face SwappingMeta-Learning	CodeCode Available	2
Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis	Jul 24, 2022	3D geometryNeRF	CodeCode Available	2
Depth-Aware Generative Adversarial Network for Talking Head Video Generation	Mar 13, 2022	3D geometryGenerative Adversarial Network	CodeCode Available	2
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation	Sep 22, 2021	Image-to-Image TranslationTalking Face Generation	CodeCode Available	2
MakeItTalk: Speaker-Aware Talking-Head Animation	Apr 27, 2020	Talking Face GenerationTalking Head Generation	CodeCode Available	2
Advancing Talking Head Generation: A Comprehensive Survey of Multi-Modal Methodologies, Datasets, Evaluation Metrics, and Loss Functions	Jun 23, 2025	NeRFTalking Head Generation	CodeCode Available	1
Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head Generation	Jun 2, 2025	MisinformationTalking Head Generation	CodeCode Available	1
Joint Co-Speech Gesture and Expressive Talking Face Generation using Diffusion with Adapters	Dec 18, 2024	Face GenerationTalking Face Generation	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 5Next →

All datasets VoxCeleb2 - 1-shot learning VoxCeleb1 - 1-shot learning VoxCeleb1 - 32-shot learning VoxCeleb1 - 8-shot learning VoxCeleb2 - 8-shot learning 100 sleep nights of 8 caregivers VoxCeleb2 - 32-shot learning

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Few-shot Adversarial Model	FID	48.5	—	Unverified
2	CainGAN	FID	35	—	Unverified
3	Fast Bi-layer Avatars (medium size)	CSIM	0.65	—	Unverified
4	First Order Motion Model (medium size)	CSIM	0.64	—	Unverified
5	Few-shot Vid-to-vid (medium size)	CSIM	0.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	X2Face	FID	45.8	—	Unverified
2	Few-shot Adversarial Model	FID	43	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	X2Face	FID	56.5	—	Unverified
2	Few-shot Adversarial Model	FID	29.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	X2Face	FID	51.5	—	Unverified
2	Few-shot Adversarial Model	FID	38	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Few-shot Adversarial Model	FID	42.2	—	Unverified
2	CainGAN	FID	24.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Ashok	10%	12	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Few-shot Adversarial Model	FID	30.6	—	Unverified