SOTAVerified

Talking Head Generation

Talking head generation is the task of generating a talking face from a set of images of a person.

( Image credit: Few-Shot Adversarial Learning of Realistic Neural Talking Head Models )

Papers

Showing 125 of 119 papers

TitleStatusHype
MEDTalk: Multimodal Controlled 3D Facial Animation with Dynamic Emotions by Disentangled Embedding0
Advancing Talking Head Generation: A Comprehensive Survey of Multi-Modal Methodologies, Datasets, Evaluation Metrics, and Loss FunctionsCode1
Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head GenerationCode1
DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations0
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution0
Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head GenerationCode3
OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication0
Audio-Plane: Audio Factorization Plane Gaussian Splatting for Real-Time Talking Head Synthesis0
Dual Audio-Centric Modality Coupling for Talking Head Generation0
Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics0
Teller: Real-Time Streaming Audio-Driven Portrait Animation with Autoregressive Motion Generation0
InsTaG: Learning Personalized 3D Talking Head from Few-Second VideoCode2
Dimitra: Audio-driven Diffusion model for Expressive Talking Head Generation0
VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control0
UniAvatar: Taming Lifelike Audio-Driven Talking Head Generation with Comprehensive Motion and Lighting Control0
Joint Co-Speech Gesture and Expressive Talking Face Generation using Diffusion with AdaptersCode1
VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization0
GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic ExpressionCode1
IF-MDM: Implicit Face Motion Diffusion Model for High-Fidelity Realtime Talking Head Generation0
EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion0
ConsistentAvatar: Learning to Diffuse Fully Consistent Talking Head Avatar with Temporal Guidance0
Comparative Analysis of Audio Feature Extraction for Real-Time Talking Portrait SynthesisCode1
LES-Talker: Fine-Grained Emotion Editing for Talking Head Generation in Linear Emotion Space0
DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video GenerationCode3
Beyond Fixed Topologies: Unregistered Training and Comprehensive Evaluation Metrics for 3D Talking HeadsCode2
Show:102550
← PrevPage 1 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Few-shot Adversarial ModelFID48.5Unverified
2CainGANFID35Unverified
3Fast Bi-layer Avatars (medium size)CSIM0.65Unverified
4First Order Motion Model (medium size)CSIM0.64Unverified
5Few-shot Vid-to-vid (medium size)CSIM0.6Unverified
#ModelMetricClaimedVerifiedStatus
1X2FaceFID45.8Unverified
2Few-shot Adversarial ModelFID43Unverified
#ModelMetricClaimedVerifiedStatus
1X2FaceFID56.5Unverified
2Few-shot Adversarial ModelFID29.5Unverified
#ModelMetricClaimedVerifiedStatus
1X2FaceFID51.5Unverified
2Few-shot Adversarial ModelFID38Unverified
#ModelMetricClaimedVerifiedStatus
1Few-shot Adversarial ModelFID42.2Unverified
2CainGANFID24.9Unverified
#ModelMetricClaimedVerifiedStatus
1Ashok10%12Unverified
#ModelMetricClaimedVerifiedStatus
1Few-shot Adversarial ModelFID30.6Unverified