| Mem2Ego: Empowering Vision-Language Models with Global-to-Ego Memory for Long-Horizon Embodied Navigation | Feb 20, 2025 | Decision MakingEfficient Exploration | —Unverified | 0 | 0 |
| MindJourney: Test-Time Scaling with World Models for Spatial Reasoning | Jul 16, 2025 | Spatial Reasoning | —Unverified | 0 | 0 |
| MLLM-For3D: Adapting Multimodal Large Language Model for 3D Reasoning Segmentation | Mar 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| MMMR: Benchmarking Massive Multi-Modal Reasoning Tasks | May 22, 2025 | BenchmarkingSpatial Reasoning | —Unverified | 0 | 0 |
| MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence | May 29, 2025 | Multiple-choiceSpatial Reasoning | —Unverified | 0 | 0 |
| Morpho-logic from a Topos Perspective: Application to symbolic AI | Mar 8, 2023 | Spatial Reasoning | —Unverified | 0 | 0 |
| Multi-camera Bird's Eye View Perception for Autonomous Driving | Sep 16, 2023 | Autonomous DrivingSensor Fusion | —Unverified | 0 | 0 |
| Non-Monotonic Spatial Reasoning with Answer Set Programming Modulo Theories | Jun 25, 2016 | Spatial Reasoning | —Unverified | 0 | 0 |
| NuScenes-SpatialQA: A Spatial Understanding and Reasoning Benchmark for Vision-Language Models in Autonomous Driving | Apr 4, 2025 | 3d scene graph generationAutonomous Driving | —Unverified | 0 | 0 |
| Object Goal Navigation with Recursive Implicit Maps | Aug 10, 2023 | NavigateObject | —Unverified | 0 | 0 |
| OmniGeo: Towards a Multimodal Large Language Models for Geospatial Artificial Intelligence | Mar 20, 2025 | Instruction FollowingNatural Language Understanding | —Unverified | 0 | 0 |
| OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models | Jun 3, 2025 | Object CountingSpatial Reasoning | —Unverified | 0 | 0 |
| On Redundant Topological Constraints | Mar 3, 2014 | Spatial Reasoning | —Unverified | 0 | 0 |
| On the Internal Topological Structure of Plane Regions | Sep 1, 2009 | Spatial Reasoning | —Unverified | 0 | 0 |
| OpenD: A Benchmark for Language-Driven Door and Drawer Opening | Dec 10, 2022 | Spatial Reasoning | —Unverified | 0 | 0 |
| OpenSU3D: Open World 3D Scene Understanding using Foundation Models | Jul 19, 2024 | Scene UnderstandingSpatial Reasoning | —Unverified | 0 | 0 |
| Optimising Language Models for Downstream Tasks: A Post-Training Perspective | Jun 26, 2025 | parameter-efficient fine-tuningSpatial Reasoning | —Unverified | 0 | 0 |
| Out of Sight, Not Out of Context? Egocentric Spatial Reasoning in VLMs Across Disjoint Frames | May 30, 2025 | ObjectSpatial Reasoning | —Unverified | 0 | 0 |
| Part Localization using Multi-Proposal Consensus for Fine-Grained Categorization | Jul 22, 2015 | General ClassificationSpatial Reasoning | —Unverified | 0 | 0 |
| Path-of-Thoughts: Extracting and Following Paths for Robust Relational Reasoning with Large Language Models | Dec 23, 2024 | Relational ReasoningSpatial Reasoning | —Unverified | 0 | 0 |
| PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning | Jun 17, 2025 | General Reinforcement LearningMultimodal Reasoning | —Unverified | 0 | 0 |
| Coarse Correspondences Boost Spatial-Temporal Reasoning in Multimodal Language Model | Aug 1, 2024 | EgoSchemaLanguage Modeling | —Unverified | 0 | 0 |
| PhyBlock: A Progressive Benchmark for Physical Understanding and Planning via 3D Block Assembly | Jun 10, 2025 | Question AnsweringScene Understanding | —Unverified | 0 | 0 |
| PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs | Feb 12, 2024 | Instruction FollowingLogical Reasoning | —Unverified | 0 | 0 |
| Pix2Scene: Learning Implicit 3D Representations from Images | May 1, 2019 | Spatial Reasoning | —Unverified | 0 | 0 |