| Grounded Reinforcement Learning for Visual Reasoning | May 29, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 |
| GSR-BENCH: A Benchmark for Grounded Spatial Reasoning Evaluation via Multimodal LLMs | Jun 19, 2024 | Spatial ReasoningVisual Reasoning | —Unverified | 0 |
| HAMMR: HierArchical MultiModal React agents for generic VQA | Apr 8, 2024 | Optical Character Recognition (OCR)Question Answering | —Unverified | 0 |
| Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation | Dec 7, 2023 | Spatial ReasoningText-to-Video Generation | —Unverified | 0 |
| History-Aware Question Answering in a Blocks World Dialogue System | May 26, 2020 | Natural Language UnderstandingQuestion Answering | —Unverified | 0 |
| How to Enable LLM with 3D Capacity? A Survey of Spatial Reasoning in LLM | Apr 8, 2025 | Autonomous VehiclesSpatial Reasoning | —Unverified | 0 |
| Hyperdimensional Computing with Spiking-Phasor Neurons | Feb 28, 2023 | Spatial Reasoning | —Unverified | 0 |
| I Know About "Up"! Enhancing Spatial Reasoning in Visual Language Models Through 3D Reconstruction | Jul 19, 2024 | 3D ReconstructionSpatial Reasoning | —Unverified | 0 |
| ImmerseGen: Agent-Guided Immersive World Generation with Alpha-Textured Proxies | Jun 17, 2025 | Scene GenerationSpatial Reasoning | —Unverified | 0 |
| Improved Algorithms for Allen's Interval Algebra by Dynamic Programming with Sublinear Partitioning | May 25, 2023 | Spatial Reasoning | —Unverified | 0 |
| Incentivizing Multimodal Reasoning in Large Models for Direct Robot Manipulation | May 19, 2025 | Multimodal ReasoningRobot Manipulation | —Unverified | 0 |
| Integrating Symbolic Reasoning into Neural Generative Models for Design Generation | Oct 13, 2023 | Spatial Reasoning | —Unverified | 0 |
| Intelligence of Things: A Spatial Context-Aware Control System for Smart Devices | Apr 16, 2025 | Spatial Reasoning | —Unverified | 0 |
| Jigsaw-Puzzles: From Seeing to Understanding to Reasoning in Vision-Language Models | May 27, 2025 | DiagnosticSpatial Reasoning | —Unverified | 0 |
| JSTR: Joint Spatio-Temporal Reasoning for Event-based Moving Object Detection | Mar 12, 2024 | Motion CompensationMoving Object Detection | —Unverified | 0 |
| LABNet: Local Graph Aggregation Network with Class Balanced Loss for Vehicle Re-Identification | Nov 29, 2020 | Spatial ReasoningVehicle Re-Identification | —Unverified | 0 |
| LanguageRefer: Spatial-Language Model for 3D Visual Grounding | Jul 7, 2021 | 3D visual groundingLanguage Modeling | —Unverified | 0 |
| Large Language-Geometry Model: When LLM meets Equivariance | Feb 16, 2025 | modelSpatial Reasoning | —Unverified | 0 |
| Large Language Models and Mathematical Reasoning Failures | Feb 17, 2025 | Mathematical ReasoningPhysical Intuition | —Unverified | 0 |
| Learning event representation: As sparse as possible, but not sparser | Oct 2, 2017 | ClassificationGeneral Classification | —Unverified | 0 |
| Learning to encode spatial relations from natural language | May 1, 2019 | Spatial Reasoning | —Unverified | 0 |
| LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning? | Mar 25, 2025 | Autonomous NavigationQuestion Answering | —Unverified | 0 |
| LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding | Dec 21, 2023 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Location-Aware Self-Supervised Transformers for Semantic Segmentation | Dec 5, 2022 | Contrastive Learningimage-classification | —Unverified | 0 |
| SlotGNN: Unsupervised Discovery of Multi-Object Representations and Visual Dynamics | Oct 6, 2023 | ObjectObject Discovery | —Unverified | 0 |