| Mem2Ego: Empowering Vision-Language Models with Global-to-Ego Memory for Long-Horizon Embodied Navigation | Feb 20, 2025 | Decision MakingEfficient Exploration | —Unverified | 0 | 0 |
| MindJourney: Test-Time Scaling with World Models for Spatial Reasoning | Jul 16, 2025 | Spatial Reasoning | —Unverified | 0 | 0 |
| MLLM-For3D: Adapting Multimodal Large Language Model for 3D Reasoning Segmentation | Mar 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| MMMR: Benchmarking Massive Multi-Modal Reasoning Tasks | May 22, 2025 | BenchmarkingSpatial Reasoning | —Unverified | 0 | 0 |
| MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence | May 29, 2025 | Multiple-choiceSpatial Reasoning | —Unverified | 0 | 0 |
| Morpho-logic from a Topos Perspective: Application to symbolic AI | Mar 8, 2023 | Spatial Reasoning | —Unverified | 0 | 0 |
| Multi-camera Bird's Eye View Perception for Autonomous Driving | Sep 16, 2023 | Autonomous DrivingSensor Fusion | —Unverified | 0 | 0 |
| Non-Monotonic Spatial Reasoning with Answer Set Programming Modulo Theories | Jun 25, 2016 | Spatial Reasoning | —Unverified | 0 | 0 |
| NuScenes-SpatialQA: A Spatial Understanding and Reasoning Benchmark for Vision-Language Models in Autonomous Driving | Apr 4, 2025 | 3d scene graph generationAutonomous Driving | —Unverified | 0 | 0 |
| Object Goal Navigation with Recursive Implicit Maps | Aug 10, 2023 | NavigateObject | —Unverified | 0 | 0 |
| OmniGeo: Towards a Multimodal Large Language Models for Geospatial Artificial Intelligence | Mar 20, 2025 | Instruction FollowingNatural Language Understanding | —Unverified | 0 | 0 |
| OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models | Jun 3, 2025 | Object CountingSpatial Reasoning | —Unverified | 0 | 0 |
| On Redundant Topological Constraints | Mar 3, 2014 | Spatial Reasoning | —Unverified | 0 | 0 |
| On the Internal Topological Structure of Plane Regions | Sep 1, 2009 | Spatial Reasoning | —Unverified | 0 | 0 |
| OpenD: A Benchmark for Language-Driven Door and Drawer Opening | Dec 10, 2022 | Spatial Reasoning | —Unverified | 0 | 0 |
| OpenSU3D: Open World 3D Scene Understanding using Foundation Models | Jul 19, 2024 | Scene UnderstandingSpatial Reasoning | —Unverified | 0 | 0 |
| Optimising Language Models for Downstream Tasks: A Post-Training Perspective | Jun 26, 2025 | parameter-efficient fine-tuningSpatial Reasoning | —Unverified | 0 | 0 |
| Out of Sight, Not Out of Context? Egocentric Spatial Reasoning in VLMs Across Disjoint Frames | May 30, 2025 | ObjectSpatial Reasoning | —Unverified | 0 | 0 |
| Part Localization using Multi-Proposal Consensus for Fine-Grained Categorization | Jul 22, 2015 | General ClassificationSpatial Reasoning | —Unverified | 0 | 0 |
| Path-of-Thoughts: Extracting and Following Paths for Robust Relational Reasoning with Large Language Models | Dec 23, 2024 | Relational ReasoningSpatial Reasoning | —Unverified | 0 | 0 |
| PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning | Jun 17, 2025 | General Reinforcement LearningMultimodal Reasoning | —Unverified | 0 | 0 |
| Coarse Correspondences Boost Spatial-Temporal Reasoning in Multimodal Language Model | Aug 1, 2024 | EgoSchemaLanguage Modeling | —Unverified | 0 | 0 |
| PhyBlock: A Progressive Benchmark for Physical Understanding and Planning via 3D Block Assembly | Jun 10, 2025 | Question AnsweringScene Understanding | —Unverified | 0 | 0 |
| PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs | Feb 12, 2024 | Instruction FollowingLogical Reasoning | —Unverified | 0 | 0 |
| Pix2Scene: Learning Implicit 3D Representations from Images | May 1, 2019 | Spatial Reasoning | —Unverified | 0 | 0 |
| Poly2Vec: Polymorphic Fourier-Based Encoding of Geospatial Objects for GeoAI Applications | Aug 27, 2024 | Spatial Reasoning | —Unverified | 0 | 0 |
| Preliminary Explorations with GPT-4o(mni) Native Image Generation | May 6, 2025 | Image Generationmultimodal generation | —Unverified | 0 | 0 |
| Proceedings of the 2nd Symposium on Problem-solving, Creativity and Spatial Reasoning in Cognitive Systems, ProSocrates 2017 | Jan 14, 2019 | Spatial Reasoning | —Unverified | 0 | 0 |
| PRS-Med: Position Reasoning Segmentation with Vision-Language Model in Medical Imaging | May 17, 2025 | Image SegmentationLanguage Modeling | —Unverified | 0 | 0 |
| Quantifying Geospatial in the Common Crawl Corpus | Jun 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| R2C: Mapping Room to Chessboard to Unlock LLM As Low-Level Action Planner | Jan 1, 2025 | Action GenerationGame of Chess | —Unverified | 0 | 0 |
| Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models | Sep 15, 2024 | Spatial Reasoning | —Unverified | 0 | 0 |
| ReCLIP: A Strong Zero-Shot Baseline for Referring Expression Comprehension | Nov 16, 2021 | image-classificationImage Classification | —Unverified | 0 | 0 |
| ReGUIDE: Data Efficient GUI Grounding via Spatial Reasoning and Search | May 21, 2025 | Spatial Reasoning | —Unverified | 0 | 0 |
| Representation, Learning and Reasoning on Spatial Language for Downstream NLP Tasks | Nov 1, 2020 | Common Sense ReasoningQuestion Answering | —Unverified | 0 | 0 |
| ReSpace: Text-Driven 3D Scene Synthesis and Editing with Preference Alignment | Jun 3, 2025 | Indoor Scene SynthesisObject | —Unverified | 0 | 0 |
| Re-Thinking Inverse Graphics With Large Language Models | Apr 23, 2024 | Language ModellingLarge Language Model | —Unverified | 0 | 0 |
| RLS3: RL-Based Synthetic Sample Selection to Enhance Spatial Reasoning in Vision-Language Models for Indoor Autonomous Perception | Jan 31, 2025 | Reinforcement Learning (RL)Spatial Reasoning | —Unverified | 0 | 0 |
| RoboHop: Segment-based Topological Map Representation for Open-World Visual Navigation | May 9, 2024 | Natural Language QueriesRobot Navigation | —Unverified | 0 | 0 |
| RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics | Jun 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics | Jun 4, 2025 | Spatial Reasoning | —Unverified | 0 | 0 |
| RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics | Nov 25, 2024 | Robot ManipulationScene Understanding | —Unverified | 0 | 0 |
| ROCKET-2: Steering Visuomotor Policy via Cross-View Goal Alignment | Mar 4, 2025 | MinecraftSpatial Reasoning | —Unverified | 0 | 0 |
| RSRWKV: A Linear-Complexity 2D Attention Mechanism for Efficient Remote Sensing Vision Task | Mar 26, 2025 | Spatial Reasoning | —Unverified | 0 | 0 |
| SAVVY: Spatial Awareness via Audio-Visual LLMs through Seeing and Hearing | Jun 4, 2025 | Spatial Reasoning | —Unverified | 0 | 0 |
| Scaling RL to Long Videos | Jul 10, 2025 | Reinforcement Learning (RL)Spatial Reasoning | —Unverified | 0 | 0 |
| SceneGPT: A Language Model for 3D Scene Understanding | Aug 13, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 | 0 |
| SpatialPIN: Enhancing Spatial Reasoning Capabilities of Vision-Language Models through Prompting and Interacting 3D Priors | Mar 18, 2024 | HallucinationMotion Planning | —Unverified | 0 | 0 |
| SEM: Enhancing Spatial Understanding for Robust Robot Manipulation | May 22, 2025 | 3D geometryRobot Manipulation | —Unverified | 0 | 0 |
| ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models | Jun 26, 2025 | Spatial ReasoningVideo Generation | —Unverified | 0 | 0 |