| Text-to-Video: a Two-stage Framework for Zero-shot Identity-agnostic Talking-head Generation | Aug 12, 2023 | Talking Head Generationtext-to-speech | CodeCode Available | 0 | 5 |
| Speaker Independent and Multilingual/Mixlingual Speech-Driven Talking Head Generation Using Phonetic Posteriorgrams | Jun 20, 2020 | Talking Head Generation | —Unverified | 0 | 0 |
| Style2Talker: High-Resolution Talking Head Generation with Emotion Style and Art Style | Mar 11, 2024 | Face GenerationTalking Face Generation | —Unverified | 0 | 0 |
| StyleTalker: One-shot Style-based Audio-driven Talking Head Video Generation | Aug 23, 2022 | Talking Head GenerationVideo Generation | —Unverified | 0 | 0 |
| SVP: Style-Enhanced Vivid Portrait Talking Head Diffusion Model | Sep 5, 2024 | DiversityTalking Head Generation | —Unverified | 0 | 0 |
| Synthesizing Photorealistic Virtual Humans Through Cross-modal Disentanglement | Sep 3, 2022 | Data AugmentationDisentanglement | —Unverified | 0 | 0 |
| TalkCLIP: Talking Head Generation with Text-Guided Expressive Speaking Styles | Apr 1, 2023 | 2D Semantic Segmentation task 3 (25 classes)Talking Head Generation | —Unverified | 0 | 0 |
| Talking Head Generation Driven by Speech-Related Facial Action Units and Audio- Based on Multimodal Representation Fusion | Apr 27, 2022 | Talking Head Generation | —Unverified | 0 | 0 |
| Talking Head Generation with Audio and Speech Related Facial Action Units | Oct 19, 2021 | Talking Head Generation | —Unverified | 0 | 0 |
| Talking Head Generation with Probabilistic Audio-to-Visual Diffusion Priors | Dec 7, 2022 | Talking Head Generation | —Unverified | 0 | 0 |
| Teller: Real-Time Streaming Audio-Driven Portrait Animation with Autoregressive Motion Generation | Mar 24, 2025 | Motion GenerationPortrait Animation | —Unverified | 0 | 0 |
| Towards Realistic Visual Dubbing with Heterogeneous Sources | Jan 17, 2022 | DisentanglementTalking Head Generation | —Unverified | 0 | 0 |
| UniAvatar: Taming Lifelike Audio-Driven Talking Head Generation with Comprehensive Motion and Lighting Control | Dec 26, 2024 | DiversityTalking Head Generation | —Unverified | 0 | 0 |
| VectorTalker: SVG Talking Face Generation with Progressive Vectorisation | Dec 18, 2023 | Face GenerationImage Reconstruction | —Unverified | 0 | 0 |
| VideoAnydoor: High-fidelity Video Object Insertion with Precise Motion Control | Jan 2, 2025 | Talking Head GenerationVideo Generation | —Unverified | 0 | 0 |
| VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior | Dec 4, 2023 | Talking Head Generation | —Unverified | 0 | 0 |
| VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization | Dec 13, 2024 | Face GenerationMotion Generation | —Unverified | 0 | 0 |
| Landmark-guided Diffusion Model for High-fidelity and Temporally Coherent Talking Head Generation | Aug 3, 2024 | DenoisingTalking Head Generation | —Unverified | 0 | 0 |
| X2Face: A network for controlling face generation using images, audio, and pose codes | Sep 1, 2018 | Face GenerationTalking Head Generation | —Unverified | 0 | 0 |
| GLDiTalker: Speech-Driven 3D Facial Animation with Graph Latent Diffusion Transformer | Aug 3, 2024 | DiversityTalking Head Generation | —Unverified | 0 | 0 |
| 3D-TalkEmo: Learning to Synthesize 3D Emotional Talking Head | Apr 25, 2021 | 3D Face ReconstructionFace Reconstruction | —Unverified | 0 | 0 |
| AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis | Dec 18, 2023 | Face GenerationNeRF | —Unverified | 0 | 0 |
| Animating Face using Disentangled Audio Representations | Oct 2, 2019 | Representation LearningTalking Head Generation | —Unverified | 0 | 0 |
| AnyoneNet: Synchronized Speech and Talking Head Generation for Arbitrary Person | Aug 9, 2021 | Talking Head Generationtext-to-speech | —Unverified | 0 | 0 |
| EmoGene: Audio-Driven Emotional 3D Talking-Head Generation | Oct 7, 2024 | NeRFTalking Head Generation | —Unverified | 0 | 0 |