| Weakly Supervised Relative Spatial Reasoning for Visual Question Answering | Sep 4, 2021 | Question AnsweringSpatial Reasoning | CodeCode Available | 0 |
| Guided Navigation from Multiple Viewpoints using Qualitative Spatial Reasoning | Nov 3, 2020 | Spatial Reasoning | CodeCode Available | 0 |
| Grounding Spatial Relations in Text-Only Language Models | Mar 20, 2024 | Spatial Reasoning | CodeCode Available | 0 |
| Polymath: A Challenging Multi-modal Mathematical Reasoning Benchmark | Oct 6, 2024 | Mathematical ReasoningSpatial Reasoning | CodeCode Available | 0 |
| OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding | Jul 10, 2025 | Scene UnderstandingSpatial Reasoning | CodeCode Available | 0 |
| Grounding Natural Language Instructions: Can Large Language Models Capture Spatial Information? | Sep 17, 2021 | Spatial Reasoning | CodeCode Available | 0 |
| No Blind Spots: Full-Surround Multi-Object Tracking for Autonomous Vehicles using Cameras & LiDARs | Feb 23, 2018 | Autonomous VehiclesMulti-Object Tracking | CodeCode Available | 0 |
| Neuro-symbolic Training for Reasoning over Spatial Language | Jun 19, 2024 | Spatial ReasoningTransfer Learning | CodeCode Available | 0 |
| SORNet: Spatial Object-Centric Representations for Sequential Manipulation | Sep 8, 2021 | ObjectRelation Classification | CodeCode Available | 0 |
| Neural Task Synthesis for Visual Programming | May 26, 2023 | Imitation LearningSpatial Reasoning | CodeCode Available | 0 |
| SpaceNLI: Evaluating the Consistency of Predicting Inferences in Space | Jul 5, 2023 | Natural Language InferenceNegation | CodeCode Available | 0 |
| SpaRC and SpaRP: Spatial Reasoning Characterization and Path Generation for Understanding Spatial Reasoning Capability of Large Language Models | Jun 7, 2024 | Spatial Reasoning | CodeCode Available | 0 |
| SPaRC: A Spatial Pathfinding Reasoning Challenge | May 22, 2025 | Spatial Reasoning | CodeCode Available | 0 |
| Narrowing the Gap between Vision and Action in Navigation | Aug 19, 2024 | DecoderSpatial Reasoning | CodeCode Available | 0 |
| Grid-augmented vision: A simple yet effective approach for enhanced spatial understanding in multi-modal agents | Nov 27, 2024 | Autonomous NavigationObject Recognition | CodeCode Available | 0 |
| MM-GEN: Enhancing Task Performance Through Targeted Multimodal Data Curation | Jan 7, 2025 | Spatial Reasoning | CodeCode Available | 0 |
| MapEval: A Map-Based Evaluation of Geo-Spatial Reasoning in Foundation Models | Dec 31, 2024 | Multiple-choiceQuestion Answering | CodeCode Available | 0 |
| EgoHumans: An Egocentric 3D Multi-Human Benchmark | May 25, 2023 | 3D Pose EstimationHuman Detection | CodeCode Available | 0 |
| Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation data | Jan 31, 2024 | BenchmarkingChange Detection | CodeCode Available | 0 |
| LOViS: Learning Orientation and Visual Signals for Vision and Language Navigation | Sep 26, 2022 | Spatial ReasoningVision and Language Navigation | CodeCode Available | 0 |
| Disentangling Extraction and Reasoning in Multi-hop Spatial Reasoning | Oct 25, 2023 | Spatial Reasoning | CodeCode Available | 0 |
| Towards Ambiguity-Free Spatial Foundation Model: Rethinking and Decoupling Depth Ambiguity | Mar 8, 2025 | Depth EstimationScene Understanding | CodeCode Available | 0 |
| DepWiGNN: A Depth-wise Graph Neural Network for Multi-hop Spatial Reasoning in Text | Oct 19, 2023 | Graph Neural NetworkSpatial Reasoning | CodeCode Available | 0 |
| Can Large Language Models Reason about the Region Connection Calculus? | Nov 29, 2024 | Spatial Reasoning | CodeCode Available | 0 |
| Location Aware Modular Biencoder for Tourism Question Answering | Jan 4, 2024 | Question AnsweringRetrieval | CodeCode Available | 0 |