| Unpaired Object-Level SAR-to-Optical Image Translation for Aircraft with Keypoints-Guided Diffusion Models | Mar 25, 2025 | TranslationZero-shot Generalization | —Unverified | 0 |
| FRESA:Feedforward Reconstruction of Personalized Skinned Avatars from Few Images | Mar 24, 2025 | 3D CanonicalizationZero-shot Generalization | CodeCode Available | 1 |
| Aether: Geometric-Aware Unified World Modeling | Mar 24, 2025 | Dynamic ReconstructionPrediction | —Unverified | 0 |
| Equivariant Image Modeling | Mar 24, 2025 | Image GenerationZero-shot Generalization | CodeCode Available | 1 |
| Enhancing Zero-Shot Image Recognition in Vision-Language Models through Human-like Concept Guidance | Mar 20, 2025 | Prompt EngineeringZero-shot Generalization | —Unverified | 0 |
| Bokehlicious: Photorealistic Bokeh Rendering with Controllable Apertures | Mar 20, 2025 | DeblurringZero-shot Generalization | CodeCode Available | 2 |
| Jasmine: Harnessing Diffusion Prior for Self-supervised Depth Estimation | Mar 20, 2025 | Depth EstimationImage Reconstruction | —Unverified | 0 |
| STOP: Integrated Spatial-Temporal Dynamic Prompting for Video Understanding | Mar 20, 2025 | Video UnderstandingZero-shot Generalization | CodeCode Available | 1 |
| Good Actions Succeed, Bad Actions Generalize: A Case Study on Why RL Generalizes Better | Mar 19, 2025 | AttributeReinforcement Learning (RL) | —Unverified | 0 |
| GenM^3: Generative Pretrained Multi-path Motion Model for Text Conditional Human Motion Generation | Mar 19, 2025 | Large Language ModelMotion Generation | —Unverified | 0 |