| Human Motion Prediction, Reconstruction, and Generation | Feb 21, 2025 | Human motion predictionHuman-Object Interaction Detection | —Unverified | 0 |
| Humanoid-VLA: Towards Universal Humanoid Control with Visual Integration | Feb 20, 2025 | Data AugmentationHumanoid Control | —Unverified | 0 |
| ModSkill: Physical Character Skill Modularization | Feb 19, 2025 | Imitation LearningMotion Generation | —Unverified | 0 |
| Leader and Follower: Interactive Motion Generation under Trajectory Constraints | Feb 17, 2025 | Motion Generation | —Unverified | 0 |
| Articulate That Object Part (ATOP): 3D Part Articulation from Text and Motion Personalization | Feb 11, 2025 | Image GenerationMotion Generation | —Unverified | 0 |
| Dress-1-to-3: Single Image to Simulation-Ready 3D Outfit with Diffusion Prior and Differentiable Physics | Feb 5, 2025 | 3D ReconstructionImage to 3D | —Unverified | 0 |
| MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent | Feb 5, 2025 | Image to Video GenerationMotion Generation | —Unverified | 0 |
| MotionLab: Unified Human Motion Generation and Editing via the Motion-Condition-Motion Paradigm | Feb 4, 2025 | Motion GenerationMulti-Task Learning | —Unverified | 0 |
| VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models | Feb 4, 2025 | Motion Generationmotion prediction | —Unverified | 0 |
| CASIM: Composite Aware Semantic Injection for Text to Motion Generation | Feb 4, 2025 | Motion Generation | —Unverified | 0 |
| OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models | Feb 3, 2025 | Human AnimationHuman-Object Interaction Detection | —Unverified | 0 |
| Strong and Controllable 3D Motion Generation | Jan 30, 2025 | Motion GenerationRobot Manipulation | —Unverified | 0 |
| Free-T2M: Frequency Enhanced Text-to-Motion Diffusion Model With Consistency Loss | Jan 30, 2025 | DenoisingMotion Generation | CodeCode Available | 2 |
| FlexMotion: Lightweight, Physics-Aware, and Controllable Human Motion Generation | Jan 28, 2025 | Computational EfficiencyDecoder | —Unverified | 0 |
| PackDiT: Joint Human Motion and Text Generation via Mutual Prompting | Jan 27, 2025 | Motion Generationmotion prediction | —Unverified | 0 |
| KETA: Kinematic-Phrases-Enhanced Text-to-Motion Generation via Fine-grained Alignment | Jan 25, 2025 | Motion GenerationMotion Synthesis | CodeCode Available | 0 |
| IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models | Jan 23, 2025 | Depth EstimationImage Generation | CodeCode Available | 0 |
| Toyteller: AI-powered Visual Storytelling Through Toy-Playing with Character Symbols | Jan 23, 2025 | Motion GenerationText Generation | —Unverified | 0 |
| Motion-X++: A Large-Scale Multimodal 3D Whole-body Human Motion Dataset | Jan 9, 2025 | Human Mesh RecoveryMotion Generation | CodeCode Available | 4 |
| Constraints as Rewards: Reinforcement Learning for Robots without Reward Functions | Jan 8, 2025 | Motion Generationreinforcement-learning | —Unverified | 0 |
| JoyGen: Audio-Driven 3D Depth-Aware Talking-Face Video Editing | Jan 3, 2025 | 3D ReconstructionFace Generation | CodeCode Available | 3 |
| I2VGuard: Safeguarding Images against Misuse in Diffusion-based Image-to-Video Models | Jan 1, 2025 | Adversarial AttackImage to Video Generation | —Unverified | 0 |
| HSI-GPT: A General-Purpose Large Scene-Motion-Language Model for Human Scene Interaction | Jan 1, 2025 | DescriptiveInstruction Following | —Unverified | 0 |
| InterAct: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation | Jan 1, 2025 | BenchmarkingHuman-Object Interaction Detection | —Unverified | 0 |
| LatentHOI: On the Generalizable Hand Object Motion Generation with Latent Hand Diffusion. | Jan 1, 2025 | Motion GenerationObject | —Unverified | 0 |