SOTAVerified

Talking Face Generation

Talking face generation aims to synthesize a sequence of face images that correspond to given speech semantics

( Image credit: Talking Face Generation by Adversarially Disentangled Audio-Visual Representation )

Papers

Showing 51100 of 110 papers

TitleStatusHype
Identity-Preserving Talking Face Generation with Landmark and Appearance PriorsCode2
High-fidelity Generalized Emotional Talking Face Generation with Multi-modal Emotion Space Learning0
Multimodal-driven Talking Face Generation via a Unified Diffusion-based Generator0
GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation0
Audio-Driven Talking Face Generation with Diverse yet Realistic Facial Animations0
That's What I Said: Fully-Controllable Talking Face Generation0
A Unified Compression Framework for Efficient Speech-Driven Talking-Face Generation0
DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder0
Seeing What You Said: Talking Face Generation Guided by a Lip Reading ExpertCode2
Emotionally Enhanced Talking Face GenerationCode2
DINet: Deformation Inpainting Network for Realistic Face Visually Dubbing on High Resolution VideoCode3
UniFLG: Unified Facial Landmark Generator from Text or Speech0
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face SynthesisCode4
DPE: Disentanglement of Pose and Expression for General Video Portrait EditingCode2
Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation0
StyleTalk: One-shot Talking Head Generation with Controllable Speaking StylesCode2
EMMN: Emotional Motion Memory Network for Audio-driven Emotional Talking Face Generation0
LipFormer: High-Fidelity and Generalizable Talking Face Generation With a Pre-Learned Facial Codebook0
Emotional Talking Faces: Making Videos More Expressive and Realistic0
Memories are One-to-Many Mapping Alleviators in Talking Face Generation0
Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial DecompositionCode2
SyncTalkFace: Talking Face Generation with Precise Lip-Syncing via Audio-Lip Memory0
Taiwanese-Accented Mandarin and English Multi-Speaker Talking-Face Synthesis System0
FNeVR: Neural Volume Rendering for Face AnimationCode1
StableFace: Analyzing and Improving Motion Stability for Talking Face Generation0
Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head SynthesisCode2
Cut Inner Layers: A Structured Pruning Strategy for Efficient U-Net GANs0
Merkel Podcast Corpus: A Multimodal Dataset Compiled from 16 Years of Angela Merkel’s Weekly Video PodcastsCode1
EAMM: One-Shot Emotional Talking Face via Audio-Based Emotion-Aware Motion Model0
One-Shot Face Reenactment on Megapixels0
Merkel Podcast Corpus: A Multimodal Dataset Compiled from 16 Years of Angela Merkel's Weekly Video PodcastsCode1
Talking Face Generation with Multilingual TTS0
Emotion-Controllable Generalized Talking Face Generation0
An Audio-Visual Attention Based Multimodal Network for Fake Talking Face Videos Detection0
Attention-Based Lip Audio-Visual Synthesis for Talking Face Generation in the Wild0
StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGANCode2
One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning0
Intelligent Video Editing: Incorporating Modern Talking Face Generation Algorithms in a Video Editor0
Live Speech Portraits: Real-Time Photorealistic Talking-Head AnimationCode2
FACIAL: Synthesizing Dynamic Talking Face with Implicit Attribute LearningCode1
Parallel and High-Fidelity Text-to-Lip GenerationCode1
Txt2Vid: Ultra-Low Bitrate Compression of Talking-Head Videos via TextCode1
Flow-Guided One-Shot Talking Face Generation With a High-Resolution Audio-Visual DatasetCode1
Text2Video: Text-driven Talking-head Video Synthesis with Personalized Phoneme-Pose DictionaryCode1
3D-TalkEmo: Learning to Synthesize 3D Emotional Talking Head0
Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual RepresentationCode1
AD-NeRF: Audio Driven Neural Radiance Fields for Talking Head SynthesisCode0
Stochastic Talking Face Generation Using Latent Distribution MatchingCode0
A Lip Sync Expert Is All You Need for Speech to Lip Generation In The WildCode3
Speech Driven Talking Face Generation from a Single Image and an Emotion ConditionCode1
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EmoGenEmoAcc83.2Unverified
#ModelMetricClaimedVerifiedStatus
1LipGANLMD0.6Unverified