| ImplicitQA: Going beyond frames towards Implicit Video Reasoning | Jun 26, 2025 | Spatial Reasoning | CodeCode Available | 0 | 5 |
| CoSpace: Benchmarking Continuous Space Perception Ability for Vision-Language Models | Mar 18, 2025 | BenchmarkingSpatial Reasoning | CodeCode Available | 0 | 5 |
| SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models | Jun 3, 2024 | Language ModellingSpatial Reasoning | CodeCode Available | 0 | 5 |
| SPhyR: Spatial-Physical Reasoning Benchmark on Material Distribution | May 21, 2025 | Spatial Reasoning | CodeCode Available | 0 | 5 |
| SpaRC and SpaRP: Spatial Reasoning Characterization and Path Generation for Understanding Spatial Reasoning Capability of Large Language Models | Jun 7, 2024 | Spatial Reasoning | CodeCode Available | 0 | 5 |
| SpaceNLI: Evaluating the Consistency of Predicting Inferences in Space | Jul 5, 2023 | Natural Language InferenceNegation | CodeCode Available | 0 | 5 |
| Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation | Dec 7, 2023 | Spatial ReasoningText-to-Video Generation | CodeCode Available | 0 | 5 |
| SORNet: Spatial Object-Centric Representations for Sequential Manipulation | Sep 8, 2021 | ObjectRelation Classification | CodeCode Available | 0 | 5 |
| Guided Navigation from Multiple Viewpoints using Qualitative Spatial Reasoning | Nov 3, 2020 | Spatial Reasoning | CodeCode Available | 0 | 5 |
| Show, Don't Tell: Evaluating Large Language Models Beyond Textual Understanding with ChildPlay | Jul 12, 2024 | Spatial Reasoning | CodeCode Available | 0 | 5 |
| Grounding Spatial Relations in Text-Only Language Models | Mar 20, 2024 | Spatial Reasoning | CodeCode Available | 0 | 5 |
| Grounding Natural Language Instructions: Can Large Language Models Capture Spatial Information? | Sep 17, 2021 | Spatial Reasoning | CodeCode Available | 0 | 5 |
| Grounded Reinforcement Learning for Visual Reasoning | May 29, 2025 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 | 5 |
| Grid-augmented vision: A simple yet effective approach for enhanced spatial understanding in multi-modal agents | Nov 27, 2024 | Autonomous NavigationObject Recognition | CodeCode Available | 0 | 5 |
| Representation Learning for Grounded Spatial Reasoning | Jul 13, 2017 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 | 5 |
| Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation data | Jan 31, 2024 | BenchmarkingChange Detection | CodeCode Available | 0 | 5 |
| Reframing Spatial Reasoning Evaluation in Language Models: A Real-World Simulation Benchmark for Qualitative Reasoning | May 23, 2024 | Logical Reasoning Question AnsweringSpatial Reasoning | CodeCode Available | 0 | 5 |
| Scaling RL to Long Videos | Jul 10, 2025 | Reinforcement Learning (RL)Spatial Reasoning | CodeCode Available | 0 | 5 |
| Polymath: A Challenging Multi-modal Mathematical Reasoning Benchmark | Oct 6, 2024 | Mathematical ReasoningSpatial Reasoning | CodeCode Available | 0 | 5 |
| OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding | Jul 10, 2025 | Scene UnderstandingSpatial Reasoning | CodeCode Available | 0 | 5 |
| From Text to Space: Mapping Abstract Spatial Models in LLMs during a Grid-World Navigation Task | Feb 23, 2025 | Decision MakingNavigate | CodeCode Available | 0 | 5 |
| Neural Task Synthesis for Visual Programming | May 26, 2023 | Imitation LearningSpatial Reasoning | CodeCode Available | 0 | 5 |
| cilantro: A Lean, Versatile, and Efficient Library for Point Cloud Data Processing | Jul 1, 2018 | ClusteringPoint Cloud Segmentation | CodeCode Available | 0 | 5 |
| Narrowing the Gap between Vision and Action in Navigation | Aug 19, 2024 | DecoderSpatial Reasoning | CodeCode Available | 0 | 5 |
| Neuro-symbolic Training for Reasoning over Spatial Language | Jun 19, 2024 | Spatial ReasoningTransfer Learning | CodeCode Available | 0 | 5 |
| MM-GEN: Enhancing Task Performance Through Targeted Multimodal Data Curation | Jan 7, 2025 | Spatial Reasoning | CodeCode Available | 0 | 5 |
| No Blind Spots: Full-Surround Multi-Object Tracking for Autonomous Vehicles using Cameras & LiDARs | Feb 23, 2018 | Autonomous VehiclesMulti-Object Tracking | CodeCode Available | 0 | 5 |
| FoREST: Frame of Reference Evaluation in Spatial Reasoning Tasks | Feb 25, 2025 | Image GenerationLayout Generation | CodeCode Available | 0 | 5 |
| LOViS: Learning Orientation and Visual Signals for Vision and Language Navigation | Sep 26, 2022 | Spatial ReasoningVision and Language Navigation | CodeCode Available | 0 | 5 |
| Location-Aware Self-Supervised Transformers for Semantic Segmentation | Dec 5, 2022 | Contrastive Learningimage-classification | CodeCode Available | 0 | 5 |
| Location Aware Modular Biencoder for Tourism Question Answering | Jan 4, 2024 | Question AnsweringRetrieval | CodeCode Available | 0 | 5 |
| MapEval: A Map-Based Evaluation of Geo-Spatial Reasoning in Foundation Models | Dec 31, 2024 | Multiple-choiceQuestion Answering | CodeCode Available | 0 | 5 |
| FloorNet: A Unified Framework for Floorplan Reconstruction from 3D Scans | Mar 31, 2018 | Spatial ReasoningVector Graphics | CodeCode Available | 0 | 5 |
| Are LLMs the Master of All Trades? : Exploring Domain-Agnostic Reasoning Skills of LLMs | Mar 22, 2023 | AllSpatial Reasoning | CodeCode Available | 0 | 5 |
| Can Vision Language Models Learn from Visual Demonstrations of Ambiguous Spatial Reasoning? | Sep 25, 2024 | In-Context LearningNovel Concepts | CodeCode Available | 0 | 5 |
| Knowing Earlier what Right Means to You: A Comprehensive VQA Dataset for Grounding Relative Directions via Multi-Task Learning | Jul 6, 2022 | DiagnosticMulti-Task Learning | CodeCode Available | 0 | 5 |
| Explicit Object Relation Alignment for Vision and Language Navigation | May 1, 2022 | ObjectRelation | CodeCode Available | 0 | 5 |
| Expand VSR Benchmark for VLLM to Expertize in Spatial Rules | Dec 24, 2024 | MMESensitivity | CodeCode Available | 0 | 5 |
| Evaluation of Code LLMs on Geospatial Code Generation | Oct 6, 2024 | Code GenerationSpatial Reasoning | CodeCode Available | 0 | 5 |
| SPaRC: A Spatial Pathfinding Reasoning Challenge | May 22, 2025 | Spatial Reasoning | CodeCode Available | 0 | 5 |
| Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors | May 30, 2025 | 3D geometryLarge Language Model | CodeCode Available | 0 | 5 |
| Can Large Language Models Reason about the Region Connection Calculus? | Nov 29, 2024 | Spatial Reasoning | CodeCode Available | 0 | 5 |
| In-the-wild Audio Spatialization with Flexible Text-guided Localization | Jun 1, 2025 | Spatial Reasoning | CodeCode Available | 0 | 5 |
| Enhancing Logical Reasoning in Large Language Models through Graph-based Synthetic Data | Sep 19, 2024 | Logical ReasoningSpatial Reasoning | CodeCode Available | 0 | 5 |
| APT: Architectural Planning and Text-to-Blueprint Construction Using Large Language Models for Open-World Agents | Nov 26, 2024 | Few-Shot LearningLarge Language Model | CodeCode Available | 0 | 5 |
| Encoding Spatial Relations from Natural Language | Jul 4, 2018 | Spatial Reasoning | CodeCode Available | 0 | 5 |
| Enabling Systematic Generalization in Abstract Spatial Reasoning through Meta-Learning for Compositionality | Apr 2, 2025 | Meta-LearningSpatial Reasoning | CodeCode Available | 0 | 5 |
| Inherent limitations of LLMs regarding spatial information | Dec 5, 2023 | Spatial Reasoning | CodeCode Available | 0 | 5 |
| Investigating Relational State Abstraction in Collaborative MARL | Dec 19, 2024 | Graph Neural NetworkMulti-agent Reinforcement Learning | CodeCode Available | 0 | 5 |
| Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs | Apr 11, 2024 | DescriptiveHallucination | CodeCode Available | 0 | 5 |