| Poly2Vec: Polymorphic Fourier-Based Encoding of Geospatial Objects for GeoAI Applications | Aug 27, 2024 | Spatial Reasoning | —Unverified | 0 | 0 |
| Preliminary Explorations with GPT-4o(mni) Native Image Generation | May 6, 2025 | Image Generationmultimodal generation | —Unverified | 0 | 0 |
| Proceedings of the 2nd Symposium on Problem-solving, Creativity and Spatial Reasoning in Cognitive Systems, ProSocrates 2017 | Jan 14, 2019 | Spatial Reasoning | —Unverified | 0 | 0 |
| PRS-Med: Position Reasoning Segmentation with Vision-Language Model in Medical Imaging | May 17, 2025 | Image SegmentationLanguage Modeling | —Unverified | 0 | 0 |
| Quantifying Geospatial in the Common Crawl Corpus | Jun 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| R2C: Mapping Room to Chessboard to Unlock LLM As Low-Level Action Planner | Jan 1, 2025 | Action GenerationGame of Chess | —Unverified | 0 | 0 |
| Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models | Sep 15, 2024 | Spatial Reasoning | —Unverified | 0 | 0 |
| ReCLIP: A Strong Zero-Shot Baseline for Referring Expression Comprehension | Nov 16, 2021 | image-classificationImage Classification | —Unverified | 0 | 0 |
| ReGUIDE: Data Efficient GUI Grounding via Spatial Reasoning and Search | May 21, 2025 | Spatial Reasoning | —Unverified | 0 | 0 |
| Representation, Learning and Reasoning on Spatial Language for Downstream NLP Tasks | Nov 1, 2020 | Common Sense ReasoningQuestion Answering | —Unverified | 0 | 0 |
| ReSpace: Text-Driven 3D Scene Synthesis and Editing with Preference Alignment | Jun 3, 2025 | Indoor Scene SynthesisObject | —Unverified | 0 | 0 |
| Re-Thinking Inverse Graphics With Large Language Models | Apr 23, 2024 | Language ModellingLarge Language Model | —Unverified | 0 | 0 |
| RLS3: RL-Based Synthetic Sample Selection to Enhance Spatial Reasoning in Vision-Language Models for Indoor Autonomous Perception | Jan 31, 2025 | Reinforcement Learning (RL)Spatial Reasoning | —Unverified | 0 | 0 |
| RoboHop: Segment-based Topological Map Representation for Open-World Visual Navigation | May 9, 2024 | Natural Language QueriesRobot Navigation | —Unverified | 0 | 0 |
| RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics | Jun 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics | Jun 4, 2025 | Spatial Reasoning | —Unverified | 0 | 0 |
| RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics | Nov 25, 2024 | Robot ManipulationScene Understanding | —Unverified | 0 | 0 |
| ROCKET-2: Steering Visuomotor Policy via Cross-View Goal Alignment | Mar 4, 2025 | MinecraftSpatial Reasoning | —Unverified | 0 | 0 |
| RSRWKV: A Linear-Complexity 2D Attention Mechanism for Efficient Remote Sensing Vision Task | Mar 26, 2025 | Spatial Reasoning | —Unverified | 0 | 0 |
| SAVVY: Spatial Awareness via Audio-Visual LLMs through Seeing and Hearing | Jun 4, 2025 | Spatial Reasoning | —Unverified | 0 | 0 |
| Scaling RL to Long Videos | Jul 10, 2025 | Reinforcement Learning (RL)Spatial Reasoning | —Unverified | 0 | 0 |
| SceneGPT: A Language Model for 3D Scene Understanding | Aug 13, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 | 0 |
| SpatialPIN: Enhancing Spatial Reasoning Capabilities of Vision-Language Models through Prompting and Interacting 3D Priors | Mar 18, 2024 | HallucinationMotion Planning | —Unverified | 0 | 0 |
| SEM: Enhancing Spatial Understanding for Robust Robot Manipulation | May 22, 2025 | 3D geometryRobot Manipulation | —Unverified | 0 | 0 |
| ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models | Jun 26, 2025 | Spatial ReasoningVideo Generation | —Unverified | 0 | 0 |