| Structured Spatial Reasoning with Open Vocabulary Object Detectors | Oct 9, 2024 | ObjectObject Rearrangement | —Unverified | 0 |
| ST-VLM: Kinematic Instruction Tuning for Spatio-Temporal Reasoning in Vision-Language Models | Mar 25, 2025 | 4D reconstructionAutonomous Driving | —Unverified | 0 |
| Tag Map: A Text-Based Map for Spatial Reasoning and Navigation with Large Language Models | Sep 23, 2024 | Common Sense ReasoningSpatial Reasoning | —Unverified | 0 |
| Talking about the Moving Image: A Declarative Model for Image Schema Based Embodied Perception Grounding and Language Generation | Aug 13, 2015 | Spatial ReasoningText Generation | —Unverified | 0 |
| Testing GPT-4-o1-preview on math and science problems: A follow-up study | Oct 11, 2024 | MathSpatial Reasoning | —Unverified | 0 |
| TopV-Nav: Unlocking the Top-View Spatial Reasoning Potential of MLLM for Zero-shot Object Navigation | Nov 25, 2024 | Spatial Reasoning | —Unverified | 0 |
| Toward 3D Spatial Reasoning for Human-like Text-based Visual Question Answering | Sep 21, 2022 | Image CaptioningOptical Character Recognition (OCR) | —Unverified | 0 |
| Towards Dynamic 3D Reconstruction of Hand-Instrument Interaction in Ophthalmic Surgery | May 23, 2025 | 3D ReconstructionHand Pose Estimation | —Unverified | 0 |
| Towards Embodied Cognition in Robots via Spatially Grounded Synthetic Worlds | May 20, 2025 | Spatial Reasoning | —Unverified | 0 |
| Towards Grounded Visual Spatial Reasoning in Multi-Modal Vision Language Models | Aug 18, 2023 | Image-text matchingObject Localization | —Unverified | 0 |