| Humanoid-VLA: Towards Universal Humanoid Control with Visual Integration | Feb 20, 2025 | Data AugmentationHumanoid Control | —Unverified | 0 |
| ModSkill: Physical Character Skill Modularization | Feb 19, 2025 | Imitation LearningMotion Generation | —Unverified | 0 |
| Leader and Follower: Interactive Motion Generation under Trajectory Constraints | Feb 17, 2025 | Motion Generation | —Unverified | 0 |
| Articulate That Object Part (ATOP): 3D Part Articulation from Text and Motion Personalization | Feb 11, 2025 | Image GenerationMotion Generation | —Unverified | 0 |
| Dress-1-to-3: Single Image to Simulation-Ready 3D Outfit with Diffusion Prior and Differentiable Physics | Feb 5, 2025 | 3D ReconstructionImage to 3D | —Unverified | 0 |
| MotionAgent: Fine-grained Controllable Video Generation via Motion Field Agent | Feb 5, 2025 | Image to Video GenerationMotion Generation | —Unverified | 0 |
| CASIM: Composite Aware Semantic Injection for Text to Motion Generation | Feb 4, 2025 | Motion Generation | —Unverified | 0 |
| MotionLab: Unified Human Motion Generation and Editing via the Motion-Condition-Motion Paradigm | Feb 4, 2025 | Motion GenerationMulti-Task Learning | —Unverified | 0 |
| VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models | Feb 4, 2025 | Motion Generationmotion prediction | —Unverified | 0 |
| OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models | Feb 3, 2025 | Human AnimationHuman-Object Interaction Detection | —Unverified | 0 |
| Strong and Controllable 3D Motion Generation | Jan 30, 2025 | Motion GenerationRobot Manipulation | —Unverified | 0 |
| FlexMotion: Lightweight, Physics-Aware, and Controllable Human Motion Generation | Jan 28, 2025 | Computational EfficiencyDecoder | —Unverified | 0 |
| PackDiT: Joint Human Motion and Text Generation via Mutual Prompting | Jan 27, 2025 | Motion Generationmotion prediction | —Unverified | 0 |
| KETA: Kinematic-Phrases-Enhanced Text-to-Motion Generation via Fine-grained Alignment | Jan 25, 2025 | Motion GenerationMotion Synthesis | CodeCode Available | 0 |
| Toyteller: AI-powered Visual Storytelling Through Toy-Playing with Character Symbols | Jan 23, 2025 | Motion GenerationText Generation | —Unverified | 0 |
| IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models | Jan 23, 2025 | Depth EstimationImage Generation | CodeCode Available | 0 |
| Constraints as Rewards: Reinforcement Learning for Robots without Reward Functions | Jan 8, 2025 | Motion Generationreinforcement-learning | —Unverified | 0 |
| POMP: Physics-consistent Motion Generative Model through Phase Manifolds | Jan 1, 2025 | Motion GenerationUnity | —Unverified | 0 |
| Rethinking Diffusion for Text-Driven Human Motion Generation: Redundant Representations, Evaluation, and Masked Autoregression | Jan 1, 2025 | Motion GenerationQuantization | —Unverified | 0 |
| IM-Zero: Instance-level Motion Controllable Video Generation in a Zero-shot Manner | Jan 1, 2025 | Motion GenerationText-to-Video Generation | —Unverified | 0 |
| I2VGuard: Safeguarding Images against Misuse in Diffusion-based Image-to-Video Models | Jan 1, 2025 | Adversarial AttackImage to Video Generation | —Unverified | 0 |
| AniMo: Species-Aware Model for Text-Driven Animal Motion Generation | Jan 1, 2025 | Motion Generation | —Unverified | 0 |
| InterAct: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation | Jan 1, 2025 | BenchmarkingHuman-Object Interaction Detection | —Unverified | 0 |
| Diffusion-based Realistic Listening Head Generation via Hybrid Motion Modeling | Jan 1, 2025 | Motion GenerationVideo Generation | —Unverified | 0 |
| HSI-GPT: A General-Purpose Large Scene-Motion-Language Model for Human Scene Interaction | Jan 1, 2025 | DescriptiveInstruction Following | —Unverified | 0 |