SOTAVerified

Talking Head Generation

Talking head generation is the task of generating a talking face from a set of images of a person.

( Image credit: Few-Shot Adversarial Learning of Realistic Neural Talking Head Models )

Papers

Showing 125 of 119 papers

TitleStatusHype
SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face AnimationCode6
Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head GenerationCode3
DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video GenerationCode3
DreamTalk: When Emotional Talking Head Generation Meets Diffusion Probabilistic ModelsCode3
SyncTalk: The Devil is in the Synchronization for Talking Head SynthesisCode3
A Lip Sync Expert Is All You Need for Speech to Lip Generation In The WildCode3
InsTaG: Learning Personalized 3D Talking Head from Few-Second VideoCode2
Beyond Fixed Topologies: Unregistered Training and Comprehensive Evaluation Metrics for 3D Talking HeadsCode2
MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head GenerationCode2
Adaptive Super Resolution For One-Shot Talking-Head GenerationCode2
Efficient Emotional Adaptation for Audio-Driven Talking-Head GenerationCode2
Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head video GenerationCode2
RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head AvatarsCode2
DaGAN++: Depth-Aware Generative Adversarial Network for Talking Head Video GenerationCode2
Emotionally Enhanced Talking Face GenerationCode2
DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits AnimationCode2
StyleTalk: One-shot Talking Head Generation with Controllable Speaking StylesCode2
MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized AdaptationCode2
Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head SynthesisCode2
Depth-Aware Generative Adversarial Network for Talking Head Video GenerationCode2
Live Speech Portraits: Real-Time Photorealistic Talking-Head AnimationCode2
MakeItTalk: Speaker-Aware Talking-Head AnimationCode2
Advancing Talking Head Generation: A Comprehensive Survey of Multi-Modal Methodologies, Datasets, Evaluation Metrics, and Loss FunctionsCode1
Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head GenerationCode1
Joint Co-Speech Gesture and Expressive Talking Face Generation using Diffusion with AdaptersCode1
Show:102550
← PrevPage 1 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Few-shot Adversarial ModelFID48.5Unverified
2CainGANFID35Unverified
3Fast Bi-layer Avatars (medium size)CSIM0.65Unverified
4First Order Motion Model (medium size)CSIM0.64Unverified
5Few-shot Vid-to-vid (medium size)CSIM0.6Unverified
#ModelMetricClaimedVerifiedStatus
1X2FaceFID45.8Unverified
2Few-shot Adversarial ModelFID43Unverified
#ModelMetricClaimedVerifiedStatus
1X2FaceFID56.5Unverified
2Few-shot Adversarial ModelFID29.5Unverified
#ModelMetricClaimedVerifiedStatus
1X2FaceFID51.5Unverified
2Few-shot Adversarial ModelFID38Unverified
#ModelMetricClaimedVerifiedStatus
1Few-shot Adversarial ModelFID42.2Unverified
2CainGANFID24.9Unverified
#ModelMetricClaimedVerifiedStatus
1Ashok10%12Unverified
#ModelMetricClaimedVerifiedStatus
1Few-shot Adversarial ModelFID30.6Unverified