| MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model | Aug 31, 2022 | DenoisingMotion Generation | CodeCode Available | 2 |
| MotionCLIP: Exposing Human Motion Generation to CLIP Space | Mar 15, 2022 | DisentanglementMotion Generation | CodeCode Available | 2 |
| StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pre-trained StyleGAN | Mar 8, 2022 | Face GenerationFacial Editing | CodeCode Available | 2 |
| Freeform Body Motion Generation from Speech | Mar 4, 2022 | DiversityMotion Generation | CodeCode Available | 2 |
| SViMo: Synchronized Diffusion for Video and Motion Generation in Hand-object Interaction Scenarios | Jun 3, 2025 | Motion GenerationVideo Generation | CodeCode Available | 1 |
| EPFL-Smart-Kitchen-30: Densely annotated cooking dataset with 3D kinematics to challenge video and language models | Jun 2, 2025 | Action RecognitionAction Segmentation | CodeCode Available | 1 |
| MMGT: Motion Mask Guided Two-Stage Network for Co-Speech Gesture Video Generation | May 29, 2025 | Motion GenerationVideo Generation | CodeCode Available | 1 |
| Wav2Sem: Plug-and-Play Audio Semantic Decoupling for 3D Speech-Driven Facial Animation | May 29, 2025 | Motion Generation | CodeCode Available | 1 |
| AnyMoLe: Any Character Motion In-betweening Leveraging Video Diffusion Models | Mar 11, 2025 | Motion Generationmotion in-betweening | CodeCode Available | 1 |
| Light-T2M: A Lightweight and Fast Model for Text-to-motion Generation | Dec 15, 2024 | GPUMamba | CodeCode Available | 1 |