| SadTalker: Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation | Nov 22, 2022 | Image AnimationTalking Head Generation | CodeCode Available | 6 |
| Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation | Apr 3, 2025 | MambaTalking Head Generation | CodeCode Available | 3 |
| DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation | Oct 17, 2024 | Talking Head GenerationVideo Generation | CodeCode Available | 3 |
| DreamTalk: When Emotional Talking Head Generation Meets Diffusion Probabilistic Models | Dec 15, 2023 | DenoisingTalking Head Generation | CodeCode Available | 3 |
| SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis | Nov 29, 2023 | NeRFTalking Face Generation | CodeCode Available | 3 |
| A Lip Sync Expert Is All You Need for Speech to Lip Generation In The Wild | Aug 23, 2020 | AllMORPH | CodeCode Available | 3 |
| InsTaG: Learning Personalized 3D Talking Head from Few-Second Video | Feb 27, 2025 | 3DGSTalking Head Generation | CodeCode Available | 2 |
| Beyond Fixed Topologies: Unregistered Training and Comprehensive Evaluation Metrics for 3D Talking Heads | Oct 14, 2024 | Talking Head Generation | CodeCode Available | 2 |
| MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation | Mar 28, 2024 | Talking Head Generation | CodeCode Available | 2 |
| Adaptive Super Resolution For One-Shot Talking-Head Generation | Mar 23, 2024 | DecoderSuper-Resolution | CodeCode Available | 2 |
| Efficient Emotional Adaptation for Audio-Driven Talking-Head Generation | Sep 10, 2023 | Talking Head Generation | CodeCode Available | 2 |
| Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head video Generation | Jul 19, 2023 | Talking Head GenerationVideo Generation | CodeCode Available | 2 |
| RenderMe-360: A Large Digital Asset Library and Benchmarks Towards High-fidelity Head Avatars | May 22, 2023 | 2kImage Matting | CodeCode Available | 2 |
| DaGAN++: Depth-Aware Generative Adversarial Network for Talking Head Video Generation | May 10, 2023 | 3D geometryGenerative Adversarial Network | CodeCode Available | 2 |
| Emotionally Enhanced Talking Face Generation | Mar 21, 2023 | Face GenerationTalking Face Generation | CodeCode Available | 2 |
| DiffTalk: Crafting Diffusion Models for Generalized Audio-Driven Portraits Animation | Jan 10, 2023 | DenoisingTalking Head Generation | CodeCode Available | 2 |
| StyleTalk: One-shot Talking Head Generation with Controllable Speaking Styles | Jan 3, 2023 | DecoderFace Generation | CodeCode Available | 2 |
| MetaPortrait: Identity-Preserving Talking Head Generation with Fast Personalized Adaptation | Dec 15, 2022 | Face SwappingMeta-Learning | CodeCode Available | 2 |
| Learning Dynamic Facial Radiance Fields for Few-Shot Talking Head Synthesis | Jul 24, 2022 | 3D geometryNeRF | CodeCode Available | 2 |
| Depth-Aware Generative Adversarial Network for Talking Head Video Generation | Mar 13, 2022 | 3D geometryGenerative Adversarial Network | CodeCode Available | 2 |
| Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation | Sep 22, 2021 | Image-to-Image TranslationTalking Face Generation | CodeCode Available | 2 |
| MakeItTalk: Speaker-Aware Talking-Head Animation | Apr 27, 2020 | Talking Face GenerationTalking Head Generation | CodeCode Available | 2 |
| Advancing Talking Head Generation: A Comprehensive Survey of Multi-Modal Methodologies, Datasets, Evaluation Metrics, and Loss Functions | Jun 23, 2025 | NeRFTalking Head Generation | CodeCode Available | 1 |
| Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head Generation | Jun 2, 2025 | MisinformationTalking Head Generation | CodeCode Available | 1 |
| Joint Co-Speech Gesture and Expressive Talking Face Generation using Diffusion with Adapters | Dec 18, 2024 | Face GenerationTalking Face Generation | CodeCode Available | 1 |
| GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expression | Dec 12, 2024 | DisentanglementPortrait Animation | CodeCode Available | 1 |
| Comparative Analysis of Audio Feature Extraction for Real-Time Talking Portrait Synthesis | Nov 20, 2024 | Talking Head Generation | CodeCode Available | 1 |
| EmoVOCA: Speech-Driven Emotional 3D Talking Heads | Mar 19, 2024 | Talking Head Generation | CodeCode Available | 1 |
| A Comparative Study of Perceptual Quality Metrics for Audio-driven Talking Head Videos | Mar 11, 2024 | Talking Head Generation | CodeCode Available | 1 |
| DiffDub: Person-generic Visual Dubbing Using Inpainting Renderer with Diffusion Auto-encoder | Nov 3, 2023 | Talking Face GenerationTalking Head Generation | CodeCode Available | 1 |
| Learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generation | Jun 2, 2023 | 3D Face AnimationTalking Head Generation | CodeCode Available | 1 |
| Face Animation with an Attribute-Guided Diffusion Model | Apr 6, 2023 | 3D Face ReconstructionAttribute | CodeCode Available | 1 |
| DisCoHead: Audio-and-Video-Driven Talking Head Generation by Disentangled Control of Head Pose and Facial Expressions | Mar 14, 2023 | Talking Head Generation | CodeCode Available | 1 |
| Autoregressive GAN for Semantic Unconditional Head Motion Generation | Nov 2, 2022 | Motion GenerationTalking Head Generation | CodeCode Available | 1 |
| Perceptual Conversational Head Generation with Regularized Driver and Enhanced Renderer | Jun 26, 2022 | Talking Head Generation | CodeCode Available | 1 |
| AI-generated characters for supporting personalized learning and well-being | Dec 15, 2021 | Face ReenactmentNeural Rendering | CodeCode Available | 1 |
| AnimeCeleb: Large-Scale Animation CelebHeads Dataset for Head Reenactment | Nov 15, 2021 | ColorizationFace Reenactment | CodeCode Available | 1 |
| Audio2Head: Audio-driven One-shot Talking-head Generation with Natural Head Motion | Jul 20, 2021 | Image GenerationTalking Head Generation | CodeCode Available | 1 |
| Txt2Vid: Ultra-Low Bitrate Compression of Talking-Head Videos via Text | Jun 26, 2021 | Talking Face GenerationTalking Head Generation | CodeCode Available | 1 |
| Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation | Apr 16, 2021 | Face ModelRhythm | CodeCode Available | 1 |
| Fast Bi-layer Neural Synthesis of One-Shot Realistic Head Avatars | Aug 24, 2020 | Neural RenderingTalking Head Generation | CodeCode Available | 1 |
| MEAD: A Large-scale Audio-visual Dataset for Emotional Talking-face Generation | Aug 1, 2020 | Face GenerationTalking Face Generation | CodeCode Available | 1 |
| Talking-head Generation with Rhythmic Head Motion | Jul 16, 2020 | Talking Head Generation | CodeCode Available | 1 |
| What comprises a good talking-head video generation?: A Survey and Benchmark | May 7, 2020 | Talking Head GenerationVideo Generation | CodeCode Available | 1 |
| Text-based Editing of Talking-head Video | Jun 4, 2019 | Face ModelSentence | CodeCode Available | 1 |
| MEDTalk: Multimodal Controlled 3D Facial Animation with Dynamic Emotions by Disentangled Embedding | Jul 8, 2025 | DiversityTalking Head Generation | —Unverified | 0 |
| DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations | May 23, 2025 | Talking Head Generation | —Unverified | 0 |
| KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution | May 1, 2025 | Talking Head Generation | —Unverified | 0 |
| OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication | Apr 3, 2025 | Talking Head GenerationVideo Synchronization | —Unverified | 0 |
| Audio-Plane: Audio Factorization Plane Gaussian Splatting for Real-Time Talking Head Synthesis | Mar 28, 2025 | Computational EfficiencyTalking Head Generation | —Unverified | 0 |