SOTAVerified

Talking Head Generation

Talking head generation is the task of generating a talking face from a set of images of a person.

( Image credit: Few-Shot Adversarial Learning of Realistic Neural Talking Head Models )

Papers

Showing 2650 of 119 papers

TitleStatusHype
GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic ExpressionCode1
Comparative Analysis of Audio Feature Extraction for Real-Time Talking Portrait SynthesisCode1
EmoVOCA: Speech-Driven Emotional 3D Talking HeadsCode1
A Comparative Study of Perceptual Quality Metrics for Audio-driven Talking Head VideosCode1
DiffDub: Person-generic Visual Dubbing Using Inpainting Renderer with Diffusion Auto-encoderCode1
Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads GenerationCode1
Face Animation with an Attribute-Guided Diffusion ModelCode1
DisCoHead: Audio-and-Video-Driven Talking Head Generation by Disentangled Control of Head Pose and Facial ExpressionsCode1
Autoregressive GAN for Semantic Unconditional Head Motion GenerationCode1
Perceptual Conversational Head Generation with Regularized Driver and Enhanced RendererCode1
AI-generated characters for supporting personalized learning and well-beingCode1
AnimeCeleb: Large-Scale Animation CelebHeads Dataset for Head ReenactmentCode1
Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head MotionCode1
Txt2Vid: Ultra-Low Bitrate Compression of Talking-Head Videos via TextCode1
Write-a-speaker: Text-based Emotional and Rhythmic Talking-head GenerationCode1
Fast Bi-layer Neural Synthesis of One-Shot Realistic Head AvatarsCode1
MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face GenerationCode1
Talking-head Generation with Rhythmic Head MotionCode1
What comprises a good talking-head video generation?: A Survey and BenchmarkCode1
Text-based Editing of Talking-head VideoCode1
MEDTalk: Multimodal Controlled 3D Facial Animation with Dynamic Emotions by Disentangled Embedding0
DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations0
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution0
OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication0
Audio-Plane: Audio Factorization Plane Gaussian Splatting for Real-Time Talking Head Synthesis0
Show:102550
← PrevPage 2 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Few-shot Adversarial ModelFID48.5Unverified
2CainGANFID35Unverified
3Fast Bi-layer Avatars (medium size)CSIM0.65Unverified
4First Order Motion Model (medium size)CSIM0.64Unverified
5Few-shot Vid-to-vid (medium size)CSIM0.6Unverified
#ModelMetricClaimedVerifiedStatus
1X2FaceFID45.8Unverified
2Few-shot Adversarial ModelFID43Unverified
#ModelMetricClaimedVerifiedStatus
1X2FaceFID56.5Unverified
2Few-shot Adversarial ModelFID29.5Unverified
#ModelMetricClaimedVerifiedStatus
1X2FaceFID51.5Unverified
2Few-shot Adversarial ModelFID38Unverified
#ModelMetricClaimedVerifiedStatus
1Few-shot Adversarial ModelFID42.2Unverified
2CainGANFID24.9Unverified
#ModelMetricClaimedVerifiedStatus
1Ashok10%12Unverified
#ModelMetricClaimedVerifiedStatus
1Few-shot Adversarial ModelFID30.6Unverified