Talking Head Generation

Talking head generation is the task of generating a talking face from a set of images of a person.

( Image credit: Few-Shot Adversarial Learning of Realistic Neural Talking Head Models )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 119 papers

Title	Date	Tasks	Status	Hype
PMMTalk: Speech-Driven 3D Facial Animation from Complementary Pseudo Multi-modal Features	Dec 5, 2023	cross-modal alignmentDecoder	—Unverified	0
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior	Dec 4, 2023	Talking Head Generation	—Unverified	0
SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis	Nov 29, 2023	NeRFTalking Face Generation	CodeCode Available	3
DiffDub: Person-generic Visual Dubbing Using Inpainting Renderer with Diffusion Auto-encoder	Nov 3, 2023	Talking Face GenerationTalking Head Generation	CodeCode Available	1
LaughTalk: Expressive 3D Talking Head Generation with Laughter	Nov 2, 2023	Talking Head Generation	—Unverified	0
OSM-Net: One-to-Many One-shot Talking Head Generation with Spontaneous Head Motions	Sep 28, 2023	Talking Head GenerationVideo Generation	—Unverified	0
Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation	Sep 10, 2023	Talking Head Generation	CodeCode Available	2
RADIO: Reference-Agnostic Dubbing Video Synthesis	Sep 5, 2023	DecoderTalking Head Generation	—Unverified	0
From Pixels to Portraits: A Comprehensive Survey of Talking Head Generation Techniques and Applications	Aug 30, 2023	NeRFSurvey	—Unverified	0
Text-to-Video: a Two-stage Framework for Zero-shot Identity-agnostic Talking-head Generation	Aug 12, 2023	Talking Head Generationtext-to-speech	CodeCode Available	0
Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head video Generation	Jul 19, 2023	Talking Head GenerationVideo Generation	CodeCode Available	2
Hierarchical Semantic Perceptual Listener Head Video Generation: A High-performance Pipeline	Jul 19, 2023	DecoderTalking Head Generation	—Unverified	0
Interactive Conversational Head Generation	Jul 5, 2023	SentenceTalking Head Generation	—Unverified	0
A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head Generation	Jul 4, 2023	Talking Head Generation	CodeCode Available	0
Emotional Talking Head Generation based on Memory-Sharing and Attention-Augmented Networks	Jun 6, 2023	Talking Head Generation	—Unverified	0
Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generation	Jun 2, 2023	3D Face AnimationTalking Head Generation	CodeCode Available	1
RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars	May 22, 2023	2kImage Matting	CodeCode Available	2
DaGAN++: Depth-Aware Generative Adversarial Network for Talking Head Video Generation	May 10, 2023	3D geometryGenerative Adversarial Network	CodeCode Available	2
High-Fidelity and Freely Controllable Talking Head Video Generation	Apr 20, 2023	Face ModelTalking Head Generation	—Unverified	0
One-Shot High-Fidelity Talking-Head Synthesis with Deformable Neural Radiance Field	Apr 11, 2023	NeRFNeural Rendering	—Unverified	0
Face Animation with an Attribute-Guided Diffusion Model	Apr 6, 2023	3D Face ReconstructionAttribute	CodeCode Available	1
TalkCLIP: Talking Head Generation with Text-Guided Expressive Speaking Styles	Apr 1, 2023	2D Semantic Segmentation task 3 (25 classes)Talking Head Generation	—Unverified	0
FONT: Flow-guided One-shot Talking Head Generation with Natural Head Motions	Mar 31, 2023	DiversityPose Prediction	—Unverified	0
Emotionally Enhanced Talking Face Generation	Mar 21, 2023	Face GenerationTalking Face Generation	CodeCode Available	2
DisCoHead: Audio-and-Video-Driven Talking Head Generation by Disentangled Control of Head Pose and Facial Expressions	Mar 14, 2023	Talking Head Generation	CodeCode Available	1
OPT: One-shot Pose-Controllable Talking Head Generation	Feb 16, 2023	DisentanglementTalking Head Generation	—Unverified	0
DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation	Jan 10, 2023	DenoisingTalking Head Generation	CodeCode Available	2
StyleTalk: One-shot Talking Head Generation with Controllable Speaking Styles	Jan 3, 2023	DecoderFace Generation	CodeCode Available	2
MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation	Dec 15, 2022	Face SwappingMeta-Learning	CodeCode Available	2
Talking Head Generation with Probabilistic Audio-to-Visual Diffusion Priors	Dec 7, 2022	Talking Head Generation	—Unverified	0
SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation	Nov 22, 2022	Image AnimationTalking Head Generation	CodeCode Available	6
Autoregressive GAN for Semantic Unconditional Head Motion Generation	Nov 2, 2022	Motion GenerationTalking Head Generation	CodeCode Available	1
Compressing Video Calls using Synthetic Talking Heads	Oct 7, 2022	Face ReenactmentTalking Head Generation	—Unverified	0
AutoLV: Automatic Lecture Video Generator	Sep 19, 2022	Speech SynthesisTalking Head Generation	—Unverified	0
Synthesizing Photorealistic Virtual Humans Through Cross-modal Disentanglement	Sep 3, 2022	Data AugmentationDisentanglement	—Unverified	0
StyleTalker: One-shot Style-based Audio-driven Talking Head Video Generation	Aug 23, 2022	Talking Head GenerationVideo Generation	—Unverified	0
Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis	Jul 24, 2022	3D geometryNeRF	CodeCode Available	2
Perceptual Conversational Head Generation with Regularized Driver and Enhanced Renderer	Jun 26, 2022	Talking Head Generation	CodeCode Available	1
One-Shot Face Reenactment on Megapixels	May 26, 2022	Face GenerationFace Reenactment	—Unverified	0
Talking Head Generation Driven by Speech-Related Facial Action Units and Audio- Based on Multimodal Representation Fusion	Apr 27, 2022	Talking Head Generation	—Unverified	0
DialogueNeRF: Towards Realistic Avatar Face-to-Face Conversation Video Generation	Mar 15, 2022	NeRFTalking Head Generation	—Unverified	0
Depth-Aware Generative Adversarial Network for Talking Head Video Generation	Mar 13, 2022	3D geometryGenerative Adversarial Network	CodeCode Available	2
Towards Realistic Visual Dubbing with Heterogeneous Sources	Jan 17, 2022	DisentanglementTalking Head Generation	—Unverified	0
DFA-NeRF: Personalized Talking Head Generation via Disentangled Face Attributes Neural Rendering	Jan 3, 2022	NeRFNeural Rendering	—Unverified	0
Expressive Talking Head Generation With Granular Audio-Visual Control	Jan 1, 2022	Talking Head Generation	—Unverified	0
Responsive Listening Head Generation: A Benchmark Dataset and Baseline	Dec 27, 2021	Talking Head GenerationTranslation	—Unverified	0
AI-generated characters for supporting personalized learning and well-being	Dec 15, 2021	Face ReenactmentNeural Rendering	CodeCode Available	1
AnimeCeleb: Large-Scale Animation CelebHeads Dataset for Head Reenactment	Nov 15, 2021	ColorizationFace Reenactment	CodeCode Available	1
Talking Head Generation with Audio and Speech Related Facial Action Units	Oct 19, 2021	Talking Head Generation	—Unverified	0
Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation	Sep 22, 2021	Image-to-Image TranslationTalking Face Generation	CodeCode Available	2

Show:10 25 50

← PrevPage 2 of 3Next →

All datasets VoxCeleb2 - 1-shot learning VoxCeleb1 - 1-shot learning VoxCeleb1 - 32-shot learning VoxCeleb1 - 8-shot learning VoxCeleb2 - 8-shot learning 100 sleep nights of 8 caregivers VoxCeleb2 - 32-shot learning

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Few-shot Adversarial Model	FID	48.5	—	Unverified
2	CainGAN	FID	35	—	Unverified
3	Fast Bi-layer Avatars (medium size)	CSIM	0.65	—	Unverified
4	First Order Motion Model (medium size)	CSIM	0.64	—	Unverified
5	Few-shot Vid-to-vid (medium size)	CSIM	0.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	X2Face	FID	45.8	—	Unverified
2	Few-shot Adversarial Model	FID	43	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	X2Face	FID	56.5	—	Unverified
2	Few-shot Adversarial Model	FID	29.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	X2Face	FID	51.5	—	Unverified
2	Few-shot Adversarial Model	FID	38	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Few-shot Adversarial Model	FID	42.2	—	Unverified
2	CainGAN	FID	24.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Ashok	10%	12	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Few-shot Adversarial Model	FID	30.6	—	Unverified