SOTAVerified

Talking Head Generation

Talking head generation is the task of generating a talking face from a set of images of a person.

( Image credit: Few-Shot Adversarial Learning of Realistic Neural Talking Head Models )

Papers

Showing 2650 of 119 papers

TitleStatusHype
EmoGene: Audio-Driven Emotional 3D Talking-Head Generation0
LaDTalk: Latent Denoising for Synthesizing Talking Head Videos with High Frequency Details0
Learning Frame-Wise Emotion Intensity for Audio-Driven Talking-Head Generation0
DreamHead: Learning Spatial-Temporal Correspondence via Hierarchical Diffusion for Audio-driven Talking Head Synthesis0
DiffTED: One-shot Audio-driven TED Talk Video Generation with Diffusion-based Co-speech Gestures0
EMOdiffhead: Continuously Emotional Control in Talking Head Generation via DiffusionCode0
SVP: Style-Enhanced Vivid Portrait Talking Head Diffusion Model0
PoseTalk: Text-and-Audio-based Pose Control and Motion Refinement for One-Shot Talking Head Generation0
FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion Model0
Landmark-guided Diffusion Model for High-fidelity and Temporally Coherent Talking Head Generation0
GLDiTalker: Speech-Driven 3D Facial Animation with Graph Latent Diffusion Transformer0
MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Dataset0
NLDF: Neural Light Dynamic Fields for Efficient 3D Talking Head Generation0
SPEAK: Speech-Driven Pose and Emotion-Adjustable Talking Head Generation0
NeRFFaceSpeech: One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior0
Embedded Representation Learning Network for Animating Styled Video Portrait0
EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis0
MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head GenerationCode2
Adaptive Super Resolution For One-Shot Talking-Head GenerationCode2
EmoVOCA: Speech-Driven Emotional 3D Talking HeadsCode1
Style2Talker: High-Resolution Talking Head Generation with Emotion Style and Art Style0
A Comparative Study of Perceptual Quality Metrics for Audio-driven Talking Head VideosCode1
VectorTalker: SVG Talking Face Generation with Progressive Vectorisation0
AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis0
DreamTalk: When Emotional Talking Head Generation Meets Diffusion Probabilistic ModelsCode3
Show:102550
← PrevPage 2 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Few-shot Adversarial ModelFID48.5Unverified
2CainGANFID35Unverified
3Fast Bi-layer Avatars (medium size)CSIM0.65Unverified
4First Order Motion Model (medium size)CSIM0.64Unverified
5Few-shot Vid-to-vid (medium size)CSIM0.6Unverified
#ModelMetricClaimedVerifiedStatus
1X2FaceFID45.8Unverified
2Few-shot Adversarial ModelFID43Unverified
#ModelMetricClaimedVerifiedStatus
1X2FaceFID56.5Unverified
2Few-shot Adversarial ModelFID29.5Unverified
#ModelMetricClaimedVerifiedStatus
1X2FaceFID51.5Unverified
2Few-shot Adversarial ModelFID38Unverified
#ModelMetricClaimedVerifiedStatus
1Few-shot Adversarial ModelFID42.2Unverified
2CainGANFID24.9Unverified
#ModelMetricClaimedVerifiedStatus
1Ashok10%12Unverified
#ModelMetricClaimedVerifiedStatus
1Few-shot Adversarial ModelFID30.6Unverified