| Location Aware Modular Biencoder for Tourism Question Answering | Jan 4, 2024 | Question AnsweringRetrieval | CodeCode Available | 0 | 5 |
| FloorNet: A Unified Framework for Floorplan Reconstruction from 3D Scans | Mar 31, 2018 | Spatial ReasoningVector Graphics | CodeCode Available | 0 | 5 |
| Are LLMs the Master of All Trades? : Exploring Domain-Agnostic Reasoning Skills of LLMs | Mar 22, 2023 | AllSpatial Reasoning | CodeCode Available | 0 | 5 |
| Can Vision Language Models Learn from Visual Demonstrations of Ambiguous Spatial Reasoning? | Sep 25, 2024 | In-Context LearningNovel Concepts | CodeCode Available | 0 | 5 |
| Knowing Earlier what Right Means to You: A Comprehensive VQA Dataset for Grounding Relative Directions via Multi-Task Learning | Jul 6, 2022 | DiagnosticMulti-Task Learning | CodeCode Available | 0 | 5 |
| Explicit Object Relation Alignment for Vision and Language Navigation | May 1, 2022 | ObjectRelation | CodeCode Available | 0 | 5 |
| Expand VSR Benchmark for VLLM to Expertize in Spatial Rules | Dec 24, 2024 | MMESensitivity | CodeCode Available | 0 | 5 |
| Evaluation of Code LLMs on Geospatial Code Generation | Oct 6, 2024 | Code GenerationSpatial Reasoning | CodeCode Available | 0 | 5 |
| Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors | May 30, 2025 | 3D geometryLarge Language Model | CodeCode Available | 0 | 5 |
| Can Large Language Models Reason about the Region Connection Calculus? | Nov 29, 2024 | Spatial Reasoning | CodeCode Available | 0 | 5 |
| SpaRC and SpaRP: Spatial Reasoning Characterization and Path Generation for Understanding Spatial Reasoning Capability of Large Language Models | Jun 7, 2024 | Spatial Reasoning | CodeCode Available | 0 | 5 |
| In-the-wild Audio Spatialization with Flexible Text-guided Localization | Jun 1, 2025 | Spatial Reasoning | CodeCode Available | 0 | 5 |
| Enhancing Logical Reasoning in Large Language Models through Graph-based Synthetic Data | Sep 19, 2024 | Logical ReasoningSpatial Reasoning | CodeCode Available | 0 | 5 |
| APT: Architectural Planning and Text-to-Blueprint Construction Using Large Language Models for Open-World Agents | Nov 26, 2024 | Few-Shot LearningLarge Language Model | CodeCode Available | 0 | 5 |
| Encoding Spatial Relations from Natural Language | Jul 4, 2018 | Spatial Reasoning | CodeCode Available | 0 | 5 |
| Enabling Systematic Generalization in Abstract Spatial Reasoning through Meta-Learning for Compositionality | Apr 2, 2025 | Meta-LearningSpatial Reasoning | CodeCode Available | 0 | 5 |
| Inherent limitations of LLMs regarding spatial information | Dec 5, 2023 | Spatial Reasoning | CodeCode Available | 0 | 5 |
| Investigating Relational State Abstraction in Collaborative MARL | Dec 19, 2024 | Graph Neural NetworkMulti-agent Reinforcement Learning | CodeCode Available | 0 | 5 |
| Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs | Apr 11, 2024 | DescriptiveHallucination | CodeCode Available | 0 | 5 |
| SPHERE: A Hierarchical Evaluation on Spatial Perception and Reasoning for Vision-Language Models | Dec 17, 2024 | Logical ReasoningSpatial Reasoning | CodeCode Available | 0 | 5 |
| EmbRACE-3K: Embodied Reasoning and Action in Complex Environments | Jul 14, 2025 | Scene UnderstandingSpatial Reasoning | —Unverified | 0 | 0 |
| ByDeWay: Boost Your multimodal LLM with DEpth prompting in a Training-Free Way | Jul 11, 2025 | Depth EstimationHallucination | —Unverified | 0 | 0 |
| Embodied World Models Emerge from Navigational Task in Open-Ended Environments | Apr 15, 2025 | Meta Reinforcement LearningSpatial Reasoning | —Unverified | 0 | 0 |
| EmbodiedVSR: Dynamic Scene Graph-Guided Chain-of-Thought Reasoning for Visual Spatial Tasks | Mar 14, 2025 | Spatial Reasoning | —Unverified | 0 | 0 |
| Bridging Visualization and Optimization: Multimodal Large Language Models on Graph-Structured Combinatorial Optimization | Jan 21, 2025 | Combinatorial OptimizationSequential Decision Making | —Unverified | 0 | 0 |