| Pro2SAM: Mask Prompt to SAM with Grid Points for Weakly Supervised Object Localization | May 8, 2025 | Object LocalizationWeakly-Supervised Object Localization | —Unverified | 0 |
| TeDA: Boosting Vision-Lanuage Models for Zero-Shot 3D Object Retrieval via Testing-time Distribution Alignment | May 5, 2025 | 3D Object RetrievalLanguage Modeling | CodeCode Available | 0 |
| A Review of 3D Object Detection with Vision-Language Models | Apr 25, 2025 | 3D Object DetectionObject | —Unverified | 0 |
| Text-to-Decision Agent: Learning Generalist Policies from Natural Language Supervision | Apr 21, 2025 | MuJoCoZero-shot Generalization | —Unverified | 0 |
| Dysarthria Normalization via Local Lie Group Transformations for Robust ASR | Apr 16, 2025 | Robust Speech Recognitionspeech-recognition | CodeCode Available | 0 |
| Evolutionary Prompt Optimization Discovers Emergent Multimodal Reasoning Strategies in Vision-Language Models | Mar 30, 2025 | Image SegmentationLanguage Modeling | —Unverified | 0 |
| Zero-shot Domain Generalization of Foundational Models for 3D Medical Image Segmentation: An Experimental Study | Mar 28, 2025 | Domain GeneralizationImage Segmentation | —Unverified | 0 |
| Unpaired Object-Level SAR-to-Optical Image Translation for Aircraft with Keypoints-Guided Diffusion Models | Mar 25, 2025 | TranslationZero-shot Generalization | —Unverified | 0 |
| Thinking agents for zero-shot generalization to qualitatively novel tasks | Mar 25, 2025 | Zero-shot Generalization | —Unverified | 0 |
| Aether: Geometric-Aware Unified World Modeling | Mar 24, 2025 | Dynamic ReconstructionPrediction | —Unverified | 0 |