| MEDTalk: Multimodal Controlled 3D Facial Animation with Dynamic Emotions by Disentangled Embedding | Jul 8, 2025 | DiversityTalking Head Generation | —Unverified | 0 |
| Advancing Talking Head Generation: A Comprehensive Survey of Multi-Modal Methodologies, Datasets, Evaluation Metrics, and Loss Functions | Jun 23, 2025 | NeRFTalking Head Generation | CodeCode Available | 1 |
| Silence is Golden: Leveraging Adversarial Examples to Nullify Audio Control in LDM-based Talking-Head Generation | Jun 2, 2025 | MisinformationTalking Head Generation | CodeCode Available | 1 |
| DualTalk: Dual-Speaker Interaction for 3D Talking Head Conversations | May 23, 2025 | Talking Head Generation | —Unverified | 0 |
| KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution | May 1, 2025 | Talking Head Generation | —Unverified | 0 |
| OmniTalker: Real-Time Text-Driven Talking Head Generation with In-Context Audio-Visual Style Replication | Apr 3, 2025 | Talking Head GenerationVideo Synchronization | —Unverified | 0 |
| Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation | Apr 3, 2025 | MambaTalking Head Generation | CodeCode Available | 3 |
| Audio-Plane: Audio Factorization Plane Gaussian Splatting for Real-Time Talking Head Synthesis | Mar 28, 2025 | Computational EfficiencyTalking Head Generation | —Unverified | 0 |
| Dual Audio-Centric Modality Coupling for Talking Head Generation | Mar 26, 2025 | NeRFTalking Head Generation | —Unverified | 0 |
| Perceptually Accurate 3D Talking Head Generation: New Definitions, Speech-Mesh Representation, and Evaluation Metrics | Mar 26, 2025 | Talking Head Generation | —Unverified | 0 |
| Teller: Real-Time Streaming Audio-Driven Portrait Animation with Autoregressive Motion Generation | Mar 24, 2025 | Motion GenerationPortrait Animation | —Unverified | 0 |
| InsTaG: Learning Personalized 3D Talking Head from Few-Second Video | Feb 27, 2025 | 3DGSTalking Head Generation | CodeCode Available | 2 |
| Dimitra: Audio-driven Diffusion model for Expressive Talking Head Generation | Feb 24, 2025 | Talking Head Generation | —Unverified | 0 |
| VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control | Jan 2, 2025 | Talking Head GenerationVideo Generation | —Unverified | 0 |
| UniAvatar: Taming Lifelike Audio-Driven Talking Head Generation with Comprehensive Motion and Lighting Control | Dec 26, 2024 | DiversityTalking Head Generation | —Unverified | 0 |
| Joint Co-Speech Gesture and Expressive Talking Face Generation using Diffusion with Adapters | Dec 18, 2024 | Face GenerationTalking Face Generation | CodeCode Available | 1 |
| VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization | Dec 13, 2024 | Face GenerationMotion Generation | —Unverified | 0 |
| GoHD: Gaze-oriented and Highly Disentangled Portrait Animation with Rhythmic Poses and Realistic Expression | Dec 12, 2024 | DisentanglementPortrait Animation | CodeCode Available | 1 |
| IF-MDM: Implicit Face Motion Diffusion Model for High-Fidelity Realtime Talking Head Generation | Dec 5, 2024 | DisentanglementTalking Head Generation | —Unverified | 0 |
| EmotiveTalk: Expressive Talking Head Generation through Audio Information Decoupling and Emotional Video Diffusion | Nov 23, 2024 | Talking Head Generation | —Unverified | 0 |
| ConsistentAvatar: Learning to Diffuse Fully Consistent Talking Head Avatar with Temporal Guidance | Nov 23, 2024 | Image Generationsingle-image-generation | —Unverified | 0 |
| Comparative Analysis of Audio Feature Extraction for Real-Time Talking Portrait Synthesis | Nov 20, 2024 | Talking Head Generation | CodeCode Available | 1 |
| LES-Talker: Fine-Grained Emotion Editing for Talking Head Generation in Linear Emotion Space | Nov 14, 2024 | Talking Head Generation | —Unverified | 0 |
| DAWN: Dynamic Frame Avatar with Non-autoregressive Diffusion Framework for Talking Head Video Generation | Oct 17, 2024 | Talking Head GenerationVideo Generation | CodeCode Available | 3 |
| Beyond Fixed Topologies: Unregistered Training and Comprehensive Evaluation Metrics for 3D Talking Heads | Oct 14, 2024 | Talking Head Generation | CodeCode Available | 2 |
| EmoGene: Audio-Driven Emotional 3D Talking-Head Generation | Oct 7, 2024 | NeRFTalking Head Generation | —Unverified | 0 |
| LaDTalk: Latent Denoising for Synthesizing Talking Head Videos with High Frequency Details | Oct 1, 2024 | DenoisingTalking Head Generation | —Unverified | 0 |
| Learning Frame-Wise Emotion Intensity for Audio-Driven Talking-Head Generation | Sep 29, 2024 | Talking Head Generation | —Unverified | 0 |
| DreamHead: Learning Spatial-Temporal Correspondence via Hierarchical Diffusion for Audio-driven Talking Head Synthesis | Sep 16, 2024 | Talking Head Generation | —Unverified | 0 |
| DiffTED: One-shot Audio-driven TED Talk Video Generation with Diffusion-based Co-speech Gestures | Sep 11, 2024 | DiversityTalking Head Generation | —Unverified | 0 |
| EMOdiffhead: Continuously Emotional Control in Talking Head Generation via Diffusion | Sep 11, 2024 | Portrait AnimationTalking Head Generation | CodeCode Available | 0 |
| SVP: Style-Enhanced Vivid Portrait Talking Head Diffusion Model | Sep 5, 2024 | DiversityTalking Head Generation | —Unverified | 0 |
| PoseTalk: Text-and-Audio-based Pose Control and Motion Refinement for One-Shot Talking Head Generation | Sep 4, 2024 | Pose PredictionRhythm | —Unverified | 0 |
| FD2Talk: Towards Generalized Talking Head Generation with Facial Decoupled Diffusion Model | Aug 18, 2024 | Talking Head Generation | —Unverified | 0 |
| Landmark-guided Diffusion Model for High-fidelity and Temporally Coherent Talking Head Generation | Aug 3, 2024 | DenoisingTalking Head Generation | —Unverified | 0 |
| GLDiTalker: Speech-Driven 3D Facial Animation with Graph Latent Diffusion Transformer | Aug 3, 2024 | DiversityTalking Head Generation | —Unverified | 0 |
| MultiTalk: Enhancing 3D Talking Head Generation Across Languages with Multilingual Video Dataset | Jun 20, 2024 | Talking Head Generation | —Unverified | 0 |
| NLDF: Neural Light Dynamic Fields for Efficient 3D Talking Head Generation | Jun 17, 2024 | Knowledge DistillationNeRF | —Unverified | 0 |
| SPEAK: Speech-Driven Pose and Emotion-Adjustable Talking Head Generation | May 12, 2024 | DisentanglementFace Generation | —Unverified | 0 |
| NeRFFaceSpeech: One-shot Audio-driven 3D Talking Head Synthesis via Generative Prior | May 9, 2024 | Face ModelNeRF | —Unverified | 0 |
| Embedded Representation Learning Network for Animating Styled Video Portrait | Apr 29, 2024 | NeRFRepresentation Learning | —Unverified | 0 |
| EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis | Apr 2, 2024 | DisentanglementTalking Head Generation | —Unverified | 0 |
| MoDiTalker: Motion-Disentangled Diffusion Model for High-Fidelity Talking Head Generation | Mar 28, 2024 | Talking Head Generation | CodeCode Available | 2 |
| Adaptive Super Resolution For One-Shot Talking-Head Generation | Mar 23, 2024 | DecoderSuper-Resolution | CodeCode Available | 2 |
| EmoVOCA: Speech-Driven Emotional 3D Talking Heads | Mar 19, 2024 | Talking Head Generation | CodeCode Available | 1 |
| Style2Talker: High-Resolution Talking Head Generation with Emotion Style and Art Style | Mar 11, 2024 | Face GenerationTalking Face Generation | —Unverified | 0 |
| A Comparative Study of Perceptual Quality Metrics for Audio-driven Talking Head Videos | Mar 11, 2024 | Talking Head Generation | CodeCode Available | 1 |
| VectorTalker: SVG Talking Face Generation with Progressive Vectorisation | Dec 18, 2023 | Face GenerationImage Reconstruction | —Unverified | 0 |
| AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis | Dec 18, 2023 | Face GenerationNeRF | —Unverified | 0 |
| DreamTalk: When Emotional Talking Head Generation Meets Diffusion Probabilistic Models | Dec 15, 2023 | DenoisingTalking Head Generation | CodeCode Available | 3 |