| Spatial Memory for Context Reasoning in Object Detection | Apr 13, 2017 | ObjectObject Detection | CodeCode Available | 0 |
| 3D CoCa: Contrastive Learners are 3D Captioners | Apr 13, 2025 | 3D dense captioningCaption Generation | CodeCode Available | 0 |
| Are LLMs the Master of All Trades? : Exploring Domain-Agnostic Reasoning Skills of LLMs | Mar 22, 2023 | AllSpatial Reasoning | CodeCode Available | 0 |
| Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMs | Apr 11, 2024 | DescriptiveHallucination | CodeCode Available | 0 |
| Dense 2D-3D Indoor Prediction with Sound via Aligned Cross-Modal Distillation | Sep 20, 2023 | 3D Scene ReconstructionDepth Estimation | CodeCode Available | 0 |
| From Text to Space: Mapping Abstract Spatial Models in LLMs during a Grid-World Navigation Task | Feb 23, 2025 | Decision MakingNavigate | CodeCode Available | 0 |
| FoREST: Frame of Reference Evaluation in Spatial Reasoning Tasks | Feb 25, 2025 | Image GenerationLayout Generation | CodeCode Available | 0 |
| Translating Place-Related Questions to GeoSPARQL Queries | May 6, 2022 | Geographic Question AnsweringQuestion Answering | CodeCode Available | 0 |
| DeepSSN: a deep convolutional neural network to assess spatial scene similarity | Feb 7, 2022 | Data AugmentationInformation Retrieval | CodeCode Available | 0 |
| APT: Architectural Planning and Text-to-Blueprint Construction Using Large Language Models for Open-World Agents | Nov 26, 2024 | Few-Shot LearningLarge Language Model | CodeCode Available | 0 |
| FloorNet: A Unified Framework for Floorplan Reconstruction from 3D Scans | Mar 31, 2018 | Spatial ReasoningVector Graphics | CodeCode Available | 0 |
| Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors | May 30, 2025 | 3D geometryLarge Language Model | CodeCode Available | 0 |
| Knowing Earlier what Right Means to You: A Comprehensive VQA Dataset for Grounding Relative Directions via Multi-Task Learning | Jul 6, 2022 | DiagnosticMulti-Task Learning | CodeCode Available | 0 |
| Bridging the Dynamic Perception Gap: Training-Free Draft Chain-of-Thought for Dynamic Multimodal Spatial Reasoning | May 22, 2025 | Spatial Reasoning | CodeCode Available | 0 |
| Explicit Object Relation Alignment for Vision and Language Navigation | May 1, 2022 | ObjectRelation | CodeCode Available | 0 |
| SPHERE: A Hierarchical Evaluation on Spatial Perception and Reasoning for Vision-Language Models | Dec 17, 2024 | Logical ReasoningSpatial Reasoning | CodeCode Available | 0 |
| SPhyR: Spatial-Physical Reasoning Benchmark on Material Distribution | May 21, 2025 | Spatial Reasoning | CodeCode Available | 0 |
| Expand VSR Benchmark for VLLM to Expertize in Spatial Rules | Dec 24, 2024 | MMESensitivity | CodeCode Available | 0 |
| CoSpace: Benchmarking Continuous Space Perception Ability for Vision-Language Models | Mar 18, 2025 | BenchmarkingSpatial Reasoning | CodeCode Available | 0 |
| Evaluation of Code LLMs on Geospatial Code Generation | Oct 6, 2024 | Code GenerationSpatial Reasoning | CodeCode Available | 0 |
| STAR-R1: Spacial TrAnsformation Reasoning by Reinforcing Multimodal LLMs | May 21, 2025 | Efficient ExplorationReinforcement Learning (RL) | CodeCode Available | 0 |
| Enhancing Logical Reasoning in Large Language Models through Graph-based Synthetic Data | Sep 19, 2024 | Logical ReasoningSpatial Reasoning | CodeCode Available | 0 |
| Investigating Relational State Abstraction in Collaborative MARL | Dec 19, 2024 | Graph Neural NetworkMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Encoding Spatial Relations from Natural Language | Jul 4, 2018 | Spatial Reasoning | CodeCode Available | 0 |
| cilantro: A Lean, Versatile, and Efficient Library for Point Cloud Data Processing | Jul 1, 2018 | ClusteringPoint Cloud Segmentation | CodeCode Available | 0 |