SOTAVerified

Talking Face Generation

Talking face generation aims to synthesize a sequence of face images that correspond to given speech semantics

( Image credit: Talking Face Generation by Adversarially Disentangled Audio-Visual Representation )

Papers

Showing 91100 of 110 papers

TitleStatusHype
FlowVQTalker: High-Quality Emotional Talking Face Generation through Normalizing Flow and Quantization0
FT2TF: First-Person Statement Text-To-Talking Face Generation0
FTFDNet: Learning to Detect Talking Face Video Manipulation with Tri-Modality Interaction0
G4G:A Generic Framework for High Fidelity Talking Face Generation with Fine-grained Intra-modal Alignment0
GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation0
GLCF: A Global-Local Multimodal Coherence Analysis Framework for Talking Face Generation Detection0
GSTalker: Real-time Audio-Driven Talking Face Generation via Deformable Gaussian Splatting0
Hierarchical Cross-Modal Talking Face Generation With Dynamic Pixel-Wise Loss0
High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model0
High-fidelity Generalized Emotional Talking Face Generation with Multi-modal Emotion Space Learning0
Show:102550
← PrevPage 10 of 11Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1EmoGenEmoAcc83.2Unverified
#ModelMetricClaimedVerifiedStatus
1LipGANLMD0.6Unverified