| MEgoHand: Multimodal Egocentric Hand-Object Interaction Motion Generation | May 22, 2025 | Motion GenerationObject | —Unverified | 0 |
| GSR-BENCH: A Benchmark for Grounded Spatial Reasoning Evaluation via Multimodal LLMs | Jun 19, 2024 | Spatial ReasoningVisual Reasoning | —Unverified | 0 |
| Integrating Symbolic Reasoning into Neural Generative Models for Design Generation | Oct 13, 2023 | Spatial Reasoning | —Unverified | 0 |
| Intelligence of Things: A Spatial Context-Aware Control System for Smart Devices | Apr 16, 2025 | Spatial Reasoning | —Unverified | 0 |
| A Survey for Foundation Models in Autonomous Driving | Feb 2, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| A LLM Benchmark based on the Minecraft Builder Dialog Agent Task | Jul 17, 2024 | MathMinecraft | —Unverified | 0 |
| Mathematical Definition and Systematization of Puzzle Rules | Dec 18, 2024 | Game DesignSpatial Reasoning | —Unverified | 0 |
| Grounded Reinforcement Learning for Visual Reasoning | May 29, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| Complexity Classification in Infinite-Domain Constraint Satisfaction | Jan 4, 2012 | ClassificationGeneral Classification | —Unverified | 0 |
| Commonsense Visual Sensemaking for Autonomous Driving: On Generalised Neurosymbolic Online Abduction Integrating Vision and Semantics | Dec 28, 2020 | Autonomous DrivingQuestion Answering | —Unverified | 0 |
| Jigsaw-Puzzles: From Seeing to Understanding to Reasoning in Vision-Language Models | May 27, 2025 | DiagnosticSpatial Reasoning | —Unverified | 0 |
| A Surprising Failure? Multimodal LLMs and the NLVR Challenge | Feb 26, 2024 | SentenceSpatial Reasoning | —Unverified | 0 |
| JSTR: Joint Spatio-Temporal Reasoning for Event-based Moving Object Detection | Mar 12, 2024 | Motion CompensationMoving Object Detection | —Unverified | 0 |
| MEBench: A Novel Benchmark for Understanding Mutual Exclusivity Bias in Vision-Language Models | May 26, 2025 | Spatial Reasoning | —Unverified | 0 |
| Mem2Ego: Empowering Vision-Language Models with Global-to-Ego Memory for Long-Horizon Embodied Navigation | Feb 20, 2025 | Decision MakingEfficient Exploration | —Unverified | 0 |
| LABNet: Local Graph Aggregation Network with Class Balanced Loss for Vehicle Re-Identification | Nov 29, 2020 | Spatial ReasoningVehicle Re-Identification | —Unverified | 0 |
| LanguageRefer: Spatial-Language Model for 3D Visual Grounding | Jul 7, 2021 | 3D visual groundingLanguage Modeling | —Unverified | 0 |
| Large Language-Geometry Model: When LLM meets Equivariance | Feb 16, 2025 | modelSpatial Reasoning | —Unverified | 0 |
| Large Language Models and Mathematical Reasoning Failures | Feb 17, 2025 | Mathematical ReasoningPhysical Intuition | —Unverified | 0 |
| GRASP: A Grid-Based Benchmark for Evaluating Commonsense Spatial Reasoning | Jul 2, 2024 | Spatial Reasoning | —Unverified | 0 |
| Graph Relation Transformer: Incorporating pairwise object features into the Transformer architecture | Nov 11, 2021 | Graph AttentionQuestion Answering | —Unverified | 0 |
| Commonsense Spatial Reasoning for Visually Intelligent Agents | Apr 1, 2021 | Spatial Reasoning | —Unverified | 0 |
| GPT-4o System Card | Oct 25, 2024 | Multiple-choiceSpatial Reasoning | —Unverified | 0 |
| Combining Deep Learning and Qualitative Spatial Reasoning to Learn Complex Structures from Sparse Examples with Noise | Nov 27, 2018 | AI AgentHeuristic Search | —Unverified | 0 |
| A Spoken Dialogue System for Spatial Question Answering in a Physical Blocks World | Nov 6, 2019 | Natural Language UnderstandingQuestion Answering | —Unverified | 0 |