| DeepGesture: A conversational gesture synthesis system based on emotions and semantics | Jul 3, 2025 | Gesture GenerationMotion Synthesis | CodeCode Available | 0 |
| Intentional Gesture: Deliver Your Intentions with Gestures for Speech | May 21, 2025 | Gesture Generation | CodeCode Available | 1 |
| M3G: Multi-Granular Gesture Generator for Audio-Driven Full-Body Human Motion Synthesis | May 13, 2025 | Gesture GenerationMotion Synthesis | —Unverified | 0 |
| Inter-Diffusion Generation Model of Speakers and Listeners for Effective Communication | May 8, 2025 | DenoisingGesture Generation | —Unverified | 0 |
| Co^3Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion | May 3, 2025 | Gesture Generation | —Unverified | 0 |
| EchoMask: Speech-Queried Attention-based Mask Modeling for Holistic Co-Speech Motion Generation | Apr 12, 2025 | Gesture GenerationMotion Generation | —Unverified | 0 |
| EasyGenNet: An Efficient Framework for Audio-Driven Gesture Video Generation Based on Diffusion Model | Apr 11, 2025 | Gesture GenerationVideo Generation | —Unverified | 0 |
| Audio-driven Gesture Generation via Deviation Feature in the Latent Space | Mar 27, 2025 | Gesture GenerationVideo Generation | —Unverified | 0 |
| SARGes: Semantically Aligned Reliable Gesture Generation via Intent Chain | Mar 26, 2025 | Gesture Generation | —Unverified | 0 |
| DIDiffGes: Decoupled Semi-Implicit Diffusion Models for Real-time Gesture Generation from Speech | Mar 21, 2025 | Gesture Generation | —Unverified | 0 |
| MAG: Multi-Modal Aligned Autoregressive Co-Speech Gesture Generation without Vector Quantization | Mar 18, 2025 | Gesture GenerationQuantization | —Unverified | 0 |
| Large Language Models for Virtual Human Gesture Selection | Mar 18, 2025 | Gesture Generation | —Unverified | 0 |
| Streaming Generation of Co-Speech Gestures via Accelerated Rolling Diffusion | Mar 13, 2025 | DiversityGesture Generation | —Unverified | 0 |
| HOP: Heterogeneous Topology-based Multimodal Entanglement for Co-Speech Gesture Generation | Mar 3, 2025 | Gesture GenerationRhythm | —Unverified | 0 |
| VarGes: Improving Variation in Co-Speech 3D Gesture Generation via StyleCLIPS | Feb 15, 2025 | 3D Human Pose EstimationDiversity | CodeCode Available | 0 |
| Contextual Gesture: Co-Speech Gesture Video Generation through Context-aware Gesture Representation | Feb 11, 2025 | Gesture GenerationVideo Generation | —Unverified | 0 |
| GestureLSM: Latent Shortcut based Co-Speech Gesture Generation with Spatial-Temporal Modeling | Jan 31, 2025 | DenoisingGesture Generation | CodeCode Available | 2 |
| EMO2: End-Effector Guided Audio-Driven Avatar Video Generation | Jan 18, 2025 | Gesture GenerationVideo Generation | —Unverified | 0 |
| Co-Speech Gesture Video Generation with Implicit Motion-Audio Entanglement | Jan 1, 2025 | Gesture GenerationMotion Generation | —Unverified | 0 |
| SemTalk: Holistic Co-speech Motion Generation with Frame-level Semantic Emphasis | Dec 21, 2024 | Gesture GenerationMotion Generation | —Unverified | 0 |
| The Language of Motion: Unifying Verbal and Non-verbal Language of 3D Human Motion | Dec 13, 2024 | DiversityGesture Generation | —Unverified | 0 |
| Retrieving Semantics from the Deep: an RAG Solution for Gesture Synthesis | Dec 9, 2024 | Gesture GenerationRAG | CodeCode Available | 2 |
| DiM-Gestor: Co-Speech Gesture Generation with Adaptive Layer Normalization Mamba-2 | Nov 23, 2024 | Gesture GenerationMamba | —Unverified | 0 |
| Conditional GAN for Enhancing Diffusion Models in Efficient and Authentic Global Gesture Generation from Audios | Oct 27, 2024 | DenoisingGesture Generation | —Unverified | 0 |
| Large Body Language Models | Oct 21, 2024 | Gesture GenerationLanguage Modeling | —Unverified | 0 |
| Emphasizing Semantic Consistency of Salient Posture for Speech-Driven Gesture Generation | Oct 17, 2024 | Gesture Generation | —Unverified | 0 |
| ExpGest: Expressive Speaker Generation Using Diffusion Model and Hybrid Audio-Text Guidance | Oct 12, 2024 | Gesture Generation | —Unverified | 0 |
| Towards a GENEA Leaderboard -- an Extended, Living Benchmark for Evaluating and Advancing Conversational Motion Synthesis | Oct 8, 2024 | Gesture GenerationMotion Synthesis | —Unverified | 0 |
| LLM Gesticulator: Leveraging Large Language Models for Scalable and Controllable Co-Speech Gesture Synthesis | Oct 6, 2024 | Gesture Generation | —Unverified | 0 |
| Enabling Synergistic Full-Body Control in Prompt-Based Co-Speech Motion Generation | Oct 1, 2024 | Gesture GenerationMotion Generation | CodeCode Available | 0 |
| MM-Conv: A Multi-modal Conversational Dataset for Virtual Humans | Sep 30, 2024 | Gesture Generation | —Unverified | 0 |
| 2D or not 2D: How Does the Dimensionality of Gesture Representation Affect 3D Co-Speech Gesture Generation? | Sep 16, 2024 | Gesture Generation | —Unverified | 0 |
| Incorporating Spatial Awareness in Data-Driven Gesture Generation for Virtual Agents | Aug 7, 2024 | Gesture Generation | —Unverified | 0 |
| MDT-A2G: Exploring Masked Diffusion Transformers for Co-Speech Gesture Generation | Aug 6, 2024 | DenoisingGesture Generation | —Unverified | 0 |
| DiM-Gesture: Co-Speech Gesture Generation with Adaptive Layer Normalization Mamba-2 framework | Aug 1, 2024 | Gesture GenerationMamba | —Unverified | 0 |
| MotionCraft: Crafting Whole-Body Motion with Plug-and-Play Multimodal Controls | Jul 30, 2024 | Gesture GenerationMotion Generation | CodeCode Available | 2 |
| Investigating the impact of 2D gesture representation on co-speech gesture generation | Jun 21, 2024 | 3D Pose EstimationGesture Generation | —Unverified | 0 |
| AMUSE: Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion | Jun 1, 2024 | Gesture GenerationRhythm | CodeCode Available | 2 |
| CoCoGesture: Toward Coherent Co-speech 3D Gesture Generation in the Wild | May 27, 2024 | Gesture Generation | —Unverified | 0 |
| LLAniMAtion: LLAMA Driven Gesture Animation | May 13, 2024 | Gesture Generation | —Unverified | 0 |
| Bridge to Non-Barrier Communication: Gloss-Prompted Fine-grained Cued Speech Gesture Generation with Diffusion Model | Apr 30, 2024 | DescriptiveGesture Generation | —Unverified | 0 |
| ConvoFusion: Multi-Modal Conversational Diffusion for Co-Speech Gesture Synthesis | Mar 26, 2024 | Gesture Generation | CodeCode Available | 1 |
| Speech-driven Personalized Gesture Synthetics: Harnessing Automatic Fuzzy Feature Inference | Mar 16, 2024 | Gesture Generation | —Unverified | 0 |
| MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models | Mar 14, 2024 | 3D Face AnimationDiversity | CodeCode Available | 2 |
| DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation | Jan 9, 2024 | Computational EfficiencyGesture Generation | —Unverified | 0 |
| Freetalker: Controllable Speech and Text-Driven Gesture Generation Based on Diffusion Models for Enhanced Speaker Naturalness | Jan 7, 2024 | Gesture GenerationMotion Generation | —Unverified | 0 |
| EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture Modeling | Dec 31, 2023 | 3D Face AnimationDiversity | CodeCode Available | 3 |
| Chain of Generation: Multi-Modal Gesture Synthesis via Cascaded Conditional Control | Dec 26, 2023 | Gesture GenerationRhythm | —Unverified | 0 |
| Emotional Speech-driven 3D Body Animation via Disentangled Latent Diffusion | Dec 7, 2023 | Gesture GenerationRhythm | CodeCode Available | 1 |
| Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-speech Gesture Generation | Nov 29, 2023 | Audio inpaintingGesture Generation | —Unverified | 0 |