| IMAGINE-E: Image Generation Intelligence Evaluation of State-of-the-art Text-to-Image Models | Jan 23, 2025 | Depth EstimationImage Generation | CodeCode Available | 0 |
| Constraints as Rewards: Reinforcement Learning for Robots without Reward Functions | Jan 8, 2025 | Motion Generationreinforcement-learning | —Unverified | 0 |
| POMP: Physics-consistent Motion Generative Model through Phase Manifolds | Jan 1, 2025 | Motion GenerationUnity | —Unverified | 0 |
| Rethinking Diffusion for Text-Driven Human Motion Generation: Redundant Representations, Evaluation, and Masked Autoregression | Jan 1, 2025 | Motion GenerationQuantization | —Unverified | 0 |
| IM-Zero: Instance-level Motion Controllable Video Generation in a Zero-shot Manner | Jan 1, 2025 | Motion GenerationText-to-Video Generation | —Unverified | 0 |
| I2VGuard: Safeguarding Images against Misuse in Diffusion-based Image-to-Video Models | Jan 1, 2025 | Adversarial AttackImage to Video Generation | —Unverified | 0 |
| AniMo: Species-Aware Model for Text-Driven Animal Motion Generation | Jan 1, 2025 | Motion Generation | —Unverified | 0 |
| InterAct: Advancing Large-Scale Versatile 3D Human-Object Interaction Generation | Jan 1, 2025 | BenchmarkingHuman-Object Interaction Detection | —Unverified | 0 |
| Diffusion-based Realistic Listening Head Generation via Hybrid Motion Modeling | Jan 1, 2025 | Motion GenerationVideo Generation | —Unverified | 0 |
| HSI-GPT: A General-Purpose Large Scene-Motion-Language Model for Human Scene Interaction | Jan 1, 2025 | DescriptiveInstruction Following | —Unverified | 0 |