SOTAVerified

Talking Head Generation

Talking head generation is the task of generating a talking face from a set of images of a person.

( Image credit: Few-Shot Adversarial Learning of Realistic Neural Talking Head Models )

Papers

Showing 51100 of 119 papers

TitleStatusHype
Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics0
Dual Audio-Centric Modality Coupling for Talking Head Generation0
Teller: Real-Time Streaming Audio-Driven Portrait Animation with Autoregressive Motion Generation0
Dimitra: Audio-driven Diffusion model for Expressive Talking Head Generation0
VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control0
UniAvatar: Taming Lifelike Audio-Driven Talking Head Generation with Comprehensive Motion and Lighting Control0
VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization0
IF-MDM: Implicit Face Motion Diffusion Model for High-Fidelity Realtime Talking Head Generation0
ConsistentAvatar: Learning to Diffuse Fully Consistent Talking Head Avatar with Temporal Guidance0
EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion0
LES-Talker: Fine-Grained Emotion Editing for Talking Head Generation in Linear Emotion Space0
EmoGene: Audio-Driven Emotional 3D Talking-Head Generation0
LaDTalk: Latent Denoising for Synthesizing Talking Head Videos with High Frequency Details0
Learning Frame-Wise Emotion Intensity for Audio-Driven Talking-Head Generation0
DreamHead: Learning Spatial-Temporal Correspondence via Hierarchical Diffusion for Audio-driven Talking Head Synthesis0
EMOdiffhead: Continuously Emotional Control in Talking Head Generation via DiffusionCode0
DiffTED: One-shot Audio-driven TED Talk Video Generation with Diffusion-based Co-speech Gestures0
SVP: Style-Enhanced Vivid Portrait Talking Head Diffusion Model0
PoseTalk: Text-and-Audio-based Pose Control and Motion Refinement for One-Shot Talking Head Generation0
FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion Model0
Landmark-guided Diffusion Model for High-fidelity and Temporally Coherent Talking Head Generation0
GLDiTalker: Speech-Driven 3D Facial Animation with Graph Latent Diffusion Transformer0
MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Dataset0
NLDF: Neural Light Dynamic Fields for Efficient 3D Talking Head Generation0
SPEAK: Speech-Driven Pose and Emotion-Adjustable Talking Head Generation0
NeRFFaceSpeech: One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior0
Embedded Representation Learning Network for Animating Styled Video Portrait0
EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis0
Style2Talker: High-Resolution Talking Head Generation with Emotion Style and Art Style0
VectorTalker: SVG Talking Face Generation with Progressive Vectorisation0
AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis0
PMMTalk: Speech-Driven 3D Facial Animation from Complementary Pseudo Multi-modal Features0
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior0
LaughTalk: Expressive 3D Talking Head Generation with Laughter0
OSM-Net: One-to-Many One-shot Talking Head Generation with Spontaneous Head Motions0
RADIO: Reference-Agnostic Dubbing Video Synthesis0
From Pixels to Portraits: A Comprehensive Survey of Talking Head Generation Techniques and Applications0
Text-to-Video: a Two-stage Framework for Zero-shot Identity-agnostic Talking-head GenerationCode0
Hierarchical Semantic Perceptual Listener Head Video Generation: A High-performance Pipeline0
Interactive Conversational Head Generation0
A Comprehensive Multi-scale Approach for Speech and Dynamics Synchrony in Talking Head GenerationCode0
Emotional Talking Head Generation based on Memory-Sharing and Attention-Augmented Networks0
High-Fidelity and Freely Controllable Talking Head Video Generation0
One-Shot High-Fidelity Talking-Head Synthesis with Deformable Neural Radiance Field0
TalkCLIP: Talking Head Generation with Text-Guided Expressive Speaking Styles0
FONT: Flow-guided One-shot Talking Head Generation with Natural Head Motions0
OPT: One-shot Pose-Controllable Talking Head Generation0
Talking Head Generation with Probabilistic Audio-to-Visual Diffusion Priors0
Compressing Video Calls using Synthetic Talking Heads0
AutoLV: Automatic Lecture Video Generator0
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Few-shot Adversarial ModelFID48.5Unverified
2CainGANFID35Unverified
3Fast Bi-layer Avatars (medium size)CSIM0.65Unverified
4First Order Motion Model (medium size)CSIM0.64Unverified
5Few-shot Vid-to-vid (medium size)CSIM0.6Unverified
#ModelMetricClaimedVerifiedStatus
1X2FaceFID45.8Unverified
2Few-shot Adversarial ModelFID43Unverified
#ModelMetricClaimedVerifiedStatus
1X2FaceFID56.5Unverified
2Few-shot Adversarial ModelFID29.5Unverified
#ModelMetricClaimedVerifiedStatus
1X2FaceFID51.5Unverified
2Few-shot Adversarial ModelFID38Unverified
#ModelMetricClaimedVerifiedStatus
1Few-shot Adversarial ModelFID42.2Unverified
2CainGANFID24.9Unverified
#ModelMetricClaimedVerifiedStatus
1Ashok10%12Unverified
#ModelMetricClaimedVerifiedStatus
1Few-shot Adversarial ModelFID30.6Unverified