| From Text to Space: Mapping Abstract Spatial Models in LLMs during a Grid-World Navigation Task | Feb 23, 2025 | Decision MakingNavigate | CodeCode Available | 0 |
| FoREST: Frame of Reference Evaluation in Spatial Reasoning Tasks | Feb 25, 2025 | Image GenerationLayout Generation | CodeCode Available | 0 |
| Translating Place-Related Questions to GeoSPARQL Queries | May 6, 2022 | Geographic Question AnsweringQuestion Answering | CodeCode Available | 0 |
| DeepSSN: a deep convolutional neural network to assess spatial scene similarity | Feb 7, 2022 | Data AugmentationInformation Retrieval | CodeCode Available | 0 |
| APT: Architectural Planning and Text-to-Blueprint Construction Using Large Language Models for Open-World Agents | Nov 26, 2024 | Few-Shot LearningLarge Language Model | CodeCode Available | 0 |
| FloorNet: A Unified Framework for Floorplan Reconstruction from 3D Scans | Mar 31, 2018 | Spatial ReasoningVector Graphics | CodeCode Available | 0 |
| Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors | May 30, 2025 | 3D geometryLarge Language Model | CodeCode Available | 0 |
| Knowing Earlier what Right Means to You: A Comprehensive VQA Dataset for Grounding Relative Directions via Multi-Task Learning | Jul 6, 2022 | DiagnosticMulti-Task Learning | CodeCode Available | 0 |
| Bridging the Dynamic Perception Gap: Training-Free Draft Chain-of-Thought for Dynamic Multimodal Spatial Reasoning | May 22, 2025 | Spatial Reasoning | CodeCode Available | 0 |
| Explicit Object Relation Alignment for Vision and Language Navigation | May 1, 2022 | ObjectRelation | CodeCode Available | 0 |