SOTAVerified

Talking Head Generation

Talking head generation is the task of generating a talking face from a set of images of a person.

( Image credit: Few-Shot Adversarial Learning of Realistic Neural Talking Head Models )

Papers

Showing 51100 of 119 papers

TitleStatusHype
PMMTalk: Speech-Driven 3D Facial Animation from Complementary Pseudo Multi-modal Features0
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior0
SyncTalk: The Devil is in the Synchronization for Talking Head SynthesisCode3
DiffDub: Person-generic Visual Dubbing Using Inpainting Renderer with Diffusion Auto-encoderCode1
LaughTalk: Expressive 3D Talking Head Generation with Laughter0
OSM-Net: One-to-Many One-shot Talking Head Generation with Spontaneous Head Motions0
Efficient Emotional Adaptation for Audio-Driven Talking-Head GenerationCode2
RADIO: Reference-Agnostic Dubbing Video Synthesis0
From Pixels to Portraits: A Comprehensive Survey of Talking Head Generation Techniques and Applications0
Text-to-Video: a Two-stage Framework for Zero-shot Identity-agnostic Talking-head GenerationCode0
Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head video GenerationCode2
Hierarchical Semantic Perceptual Listener Head Video Generation: A High-performance Pipeline0
Interactive Conversational Head Generation0
A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head GenerationCode0
Emotional Talking Head Generation based on Memory-Sharing and Attention-Augmented Networks0
Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads GenerationCode1
RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head AvatarsCode2
DaGAN++: Depth-Aware Generative Adversarial Network for Talking Head Video GenerationCode2
High-Fidelity and Freely Controllable Talking Head Video Generation0
One-Shot High-Fidelity Talking-Head Synthesis with Deformable Neural Radiance Field0
Face Animation with an Attribute-Guided Diffusion ModelCode1
TalkCLIP: Talking Head Generation with Text-Guided Expressive Speaking Styles0
FONT: Flow-guided One-shot Talking Head Generation with Natural Head Motions0
Emotionally Enhanced Talking Face GenerationCode2
DisCoHead: Audio-and-Video-Driven Talking Head Generation by Disentangled Control of Head Pose and Facial ExpressionsCode1
OPT: One-shot Pose-Controllable Talking Head Generation0
DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits AnimationCode2
StyleTalk: One-shot Talking Head Generation with Controllable Speaking StylesCode2
MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized AdaptationCode2
Talking Head Generation with Probabilistic Audio-to-Visual Diffusion Priors0
SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face AnimationCode6
Autoregressive GAN for Semantic Unconditional Head Motion GenerationCode1
Compressing Video Calls using Synthetic Talking Heads0
AutoLV: Automatic Lecture Video Generator0
Synthesizing Photorealistic Virtual Humans Through Cross-modal Disentanglement0
StyleTalker: One-shot Style-based Audio-driven Talking Head Video Generation0
Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head SynthesisCode2
Perceptual Conversational Head Generation with Regularized Driver and Enhanced RendererCode1
One-Shot Face Reenactment on Megapixels0
Talking Head Generation Driven by Speech-Related Facial Action Units and Audio- Based on Multimodal Representation Fusion0
DialogueNeRF: Towards Realistic Avatar Face-to-Face Conversation Video Generation0
Depth-Aware Generative Adversarial Network for Talking Head Video GenerationCode2
Towards Realistic Visual Dubbing with Heterogeneous Sources0
DFA-NeRF: Personalized Talking Head Generation via Disentangled Face Attributes Neural Rendering0
Expressive Talking Head Generation With Granular Audio-Visual Control0
Responsive Listening Head Generation: A Benchmark Dataset and Baseline0
AI-generated characters for supporting personalized learning and well-beingCode1
AnimeCeleb: Large-Scale Animation CelebHeads Dataset for Head ReenactmentCode1
Talking Head Generation with Audio and Speech Related Facial Action Units0
Live Speech Portraits: Real-Time Photorealistic Talking-Head AnimationCode2
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Few-shot Adversarial ModelFID48.5Unverified
2CainGANFID35Unverified
3Fast Bi-layer Avatars (medium size)CSIM0.65Unverified
4First Order Motion Model (medium size)CSIM0.64Unverified
5Few-shot Vid-to-vid (medium size)CSIM0.6Unverified
#ModelMetricClaimedVerifiedStatus
1X2FaceFID45.8Unverified
2Few-shot Adversarial ModelFID43Unverified
#ModelMetricClaimedVerifiedStatus
1X2FaceFID56.5Unverified
2Few-shot Adversarial ModelFID29.5Unverified
#ModelMetricClaimedVerifiedStatus
1X2FaceFID51.5Unverified
2Few-shot Adversarial ModelFID38Unverified
#ModelMetricClaimedVerifiedStatus
1Few-shot Adversarial ModelFID42.2Unverified
2CainGANFID24.9Unverified
#ModelMetricClaimedVerifiedStatus
1Ashok10%12Unverified
#ModelMetricClaimedVerifiedStatus
1Few-shot Adversarial ModelFID30.6Unverified