| Visuospatial Cognitive Assistant | May 18, 2025 | Spatial Reasoning | CodeCode Available | 1 |
| Towards Visuospatial Cognition via Hierarchical Fusion of Visual Experts | May 18, 2025 | Spatial Reasoning | CodeCode Available | 1 |
| PRS-Med: Position Reasoning Segmentation with Vision-Language Model in Medical Imaging | May 17, 2025 | Image SegmentationLanguage Modeling | —Unverified | 0 |
| Are Multimodal Large Language Models Ready for Omnidirectional Spatial Reasoning? | May 17, 2025 | HallucinationObject Counting | —Unverified | 0 |
| A Light and Smart Wearable Platform with Multimodal Foundation Model for Enhanced Spatial Reasoning in People with Blindness and Low Vision | May 16, 2025 | Large Language ModelNavigate | —Unverified | 0 |
| From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation | May 13, 2025 | Robot ManipulationSpatial Reasoning | CodeCode Available | 1 |
| Text-to-CadQuery: A New Paradigm for CAD Generation with Scalable Large Model Capabilities | May 10, 2025 | Spatial Reasoning | CodeCode Available | 2 |
| CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory | May 8, 2025 | Large Language ModelNavigate | CodeCode Available | 1 |
| SpatialPrompting: Keyframe-driven Zero-Shot Spatial Reasoning with Off-the-Shelf Multimodal Large Language Models | May 8, 2025 | Spatial Reasoning | —Unverified | 0 |
| SITE: towards Spatial Intelligence Thorough Evaluation | May 8, 2025 | Question AnsweringSpatial Reasoning | —Unverified | 0 |