SOTAVerified

Talking Head Generation

Talking head generation is the task of generating a talking face from a set of images of a person.

( Image credit: Few-Shot Adversarial Learning of Realistic Neural Talking Head Models )

Papers

Showing 5175 of 119 papers

TitleStatusHype
Responsive Listening Head Generation: A Benchmark Dataset and Baseline0
Speaker Independent and Multilingual/Mixlingual Speech-Driven Talking Head Generation Using Phonetic Posteriorgrams0
Style2Talker: High-Resolution Talking Head Generation with Emotion Style and Art Style0
StyleTalker: One-shot Style-based Audio-driven Talking Head Video Generation0
SVP: Style-Enhanced Vivid Portrait Talking Head Diffusion Model0
Synthesizing Photorealistic Virtual Humans Through Cross-modal Disentanglement0
TalkCLIP: Talking Head Generation with Text-Guided Expressive Speaking Styles0
Talking Head Generation Driven by Speech-Related Facial Action Units and Audio- Based on Multimodal Representation Fusion0
Talking Head Generation with Audio and Speech Related Facial Action Units0
Talking Head Generation with Probabilistic Audio-to-Visual Diffusion Priors0
X2Face: A network for controlling face generation using images, audio, and pose codes0
GLDiTalker: Speech-Driven 3D Facial Animation with Graph Latent Diffusion Transformer0
3D-TalkEmo: Learning to Synthesize 3D Emotional Talking Head0
AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis0
Animating Face using Disentangled Audio Representations0
AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Person0
EmoGene: Audio-Driven Emotional 3D Talking-Head Generation0
Audio-Plane: Audio Factorization Plane Gaussian Splatting for Real-Time Talking Head Synthesis0
AutoLV: Automatic Lecture Video Generator0
Compressing Video Calls using Synthetic Talking Heads0
ConsistentAvatar: Learning to Diffuse Fully Consistent Talking Head Avatar with Temporal Guidance0
Deep Person Generation: A Survey from the Perspective of Face, Pose and Cloth Synthesis0
DFA-NeRF: Personalized Talking Head Generation via Disentangled Face Attributes Neural Rendering0
DialogueNeRF: Towards Realistic Avatar Face-to-Face Conversation Video Generation0
DiffTED: One-shot Audio-driven TED Talk Video Generation with Diffusion-based Co-speech Gestures0
Show:102550
← PrevPage 3 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Few-shot Adversarial ModelFID48.5Unverified
2CainGANFID35Unverified
3Fast Bi-layer Avatars (medium size)CSIM0.65Unverified
4First Order Motion Model (medium size)CSIM0.64Unverified
5Few-shot Vid-to-vid (medium size)CSIM0.6Unverified
#ModelMetricClaimedVerifiedStatus
1X2FaceFID45.8Unverified
2Few-shot Adversarial ModelFID43Unverified
#ModelMetricClaimedVerifiedStatus
1X2FaceFID56.5Unverified
2Few-shot Adversarial ModelFID29.5Unverified
#ModelMetricClaimedVerifiedStatus
1X2FaceFID51.5Unverified
2Few-shot Adversarial ModelFID38Unverified
#ModelMetricClaimedVerifiedStatus
1Few-shot Adversarial ModelFID42.2Unverified
2CainGANFID24.9Unverified
#ModelMetricClaimedVerifiedStatus
1Ashok10%12Unverified
#ModelMetricClaimedVerifiedStatus
1Few-shot Adversarial ModelFID30.6Unverified