| KETA: Kinematic-Phrases-Enhanced Text-to-Motion Generation via Fine-grained Alignment | Jan 25, 2025 | Motion GenerationMotion Synthesis | CodeCode Available | 0 |
| IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models | Jan 23, 2025 | Depth EstimationImage Generation | CodeCode Available | 0 |
| Toyteller: AI-powered Visual Storytelling Through Toy-Playing with Character Symbols | Jan 23, 2025 | Motion GenerationText Generation | —Unverified | 0 |
| Motion-X++: A Large-Scale Multimodal 3D Whole-body Human Motion Dataset | Jan 9, 2025 | Human Mesh RecoveryMotion Generation | CodeCode Available | 4 |
| Constraints as Rewards: Reinforcement Learning for Robots without Reward Functions | Jan 8, 2025 | Motion Generationreinforcement-learning | —Unverified | 0 |
| JoyGen: Audio-Driven 3D Depth-Aware Talking-Face Video Editing | Jan 3, 2025 | 3D ReconstructionFace Generation | CodeCode Available | 3 |
| I2VGuard: Safeguarding Images against Misuse in Diffusion-based Image-to-Video Models | Jan 1, 2025 | Adversarial AttackImage to Video Generation | —Unverified | 0 |
| HSI-GPT: A General-Purpose Large Scene-Motion-Language Model for Human Scene Interaction | Jan 1, 2025 | DescriptiveInstruction Following | —Unverified | 0 |
| InterAct: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation | Jan 1, 2025 | BenchmarkingHuman-Object Interaction Detection | —Unverified | 0 |
| LatentHOI: On the Generalizable Hand Object Motion Generation with Latent Hand Diffusion. | Jan 1, 2025 | Motion GenerationObject | —Unverified | 0 |