| FlowVQA: Mapping Multimodal Logic in Visual Question Answering with Flowcharts | Jun 27, 2024 | Decision MakingLogical Reasoning | —Unverified | 0 | 0 |
| FollowEval: A Multi-Dimensional Benchmark for Assessing the Instruction-Following Capability of Large Language Models | Nov 16, 2023 | Instruction FollowingLogical Reasoning | —Unverified | 0 | 0 |
| Following Instructions by Imagining and Reaching Visual Goals | Jan 25, 2020 | Instruction FollowingReinforcement Learning | —Unverified | 0 | 0 |
| Foundation Models for Remote Sensing: An Analysis of MLLMs for Object Localization | Apr 14, 2025 | BenchmarkingEarth Observation | —Unverified | 0 | 0 |
| FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors | May 2, 2025 | ObjectSpatial Reasoning | —Unverified | 0 | 0 |
| From 2D to 3D Cognition: A Brief Survey of General World Models | Jun 25, 2025 | Autonomous DrivingScene Generation | —Unverified | 0 | 0 |
| From Objects to Anywhere: A Holistic Benchmark for Multi-level Visual Grounding in 3D Scenes | Jun 5, 2025 | 3D visual groundingObject | —Unverified | 0 | 0 |
| From Patches to Objects: Exploiting Spatial Reasoning for Better Visual Representations | May 21, 2023 | Contrastive LearningLinear evaluation | —Unverified | 0 | 0 |
| From Spatial Relations to Spatial Configurations | Jul 19, 2020 | Abstract Meaning RepresentationNatural Language Understanding | —Unverified | 0 | 0 |
| From Templates to Natural Language: Generalization Challenges in Instruction-Tuned LLMs for Spatial Reasoning | May 20, 2025 | Spatial Reasoning | —Unverified | 0 | 0 |
| Generating Human Motion in 3D Scenes from Text Descriptions | May 13, 2024 | Motion GenerationObject | —Unverified | 0 | 0 |
| Geo-LLaVA: A Large Multi-Modal Model for Solving Geometry Math Problems with Meta In-Context Learning | Dec 12, 2024 | Geometry Problem SolvingIn-Context Learning | —Unverified | 0 | 0 |
| Geometric Feature Enhanced Knowledge Graph Embedding and Spatial Reasoning | Oct 24, 2024 | Graph EmbeddingKnowledge Graph Embedding | —Unverified | 0 | 0 |
| Geometry of 3D Environments and Sum of Squares Polynomials | Nov 22, 2016 | Spatial Reasoning | —Unverified | 0 | 0 |
| Global Information Guided Video Anomaly Detection | Apr 14, 2021 | Anomaly DetectionSpatial Reasoning | —Unverified | 0 | 0 |
| GPT-4o System Card | Oct 25, 2024 | Multiple-choiceSpatial Reasoning | —Unverified | 0 | 0 |
| Graph Relation Transformer: Incorporating pairwise object features into the Transformer architecture | Nov 11, 2021 | Graph AttentionQuestion Answering | —Unverified | 0 | 0 |
| GRASP: A Grid-Based Benchmark for Evaluating Commonsense Spatial Reasoning | Jul 2, 2024 | Spatial Reasoning | —Unverified | 0 | 0 |
| Grounded Reinforcement Learning for Visual Reasoning | May 29, 2025 | reinforcement-learningReinforcement Learning | —Unverified | 0 | 0 |
| GSR-BENCH: A Benchmark for Grounded Spatial Reasoning Evaluation via Multimodal LLMs | Jun 19, 2024 | Spatial ReasoningVisual Reasoning | —Unverified | 0 | 0 |
| HAMMR: HierArchical MultiModal React agents for generic VQA | Apr 8, 2024 | Optical Character Recognition (OCR)Question Answering | —Unverified | 0 | 0 |
| Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation | Dec 7, 2023 | Spatial ReasoningText-to-Video Generation | —Unverified | 0 | 0 |
| History-Aware Question Answering in a Blocks World Dialogue System | May 26, 2020 | Natural Language UnderstandingQuestion Answering | —Unverified | 0 | 0 |
| How to Enable LLM with 3D Capacity? A Survey of Spatial Reasoning in LLM | Apr 8, 2025 | Autonomous VehiclesSpatial Reasoning | —Unverified | 0 | 0 |
| Hyperdimensional Computing with Spiking-Phasor Neurons | Feb 28, 2023 | Spatial Reasoning | —Unverified | 0 | 0 |
| I Know About "Up"! Enhancing Spatial Reasoning in Visual Language Models Through 3D Reconstruction | Jul 19, 2024 | 3D ReconstructionSpatial Reasoning | —Unverified | 0 | 0 |
| ImmerseGen: Agent-Guided Immersive World Generation with Alpha-Textured Proxies | Jun 17, 2025 | Scene GenerationSpatial Reasoning | —Unverified | 0 | 0 |
| Improved Algorithms for Allen's Interval Algebra by Dynamic Programming with Sublinear Partitioning | May 25, 2023 | Spatial Reasoning | —Unverified | 0 | 0 |
| Incentivizing Multimodal Reasoning in Large Models for Direct Robot Manipulation | May 19, 2025 | Multimodal ReasoningRobot Manipulation | —Unverified | 0 | 0 |
| Integrating Symbolic Reasoning into Neural Generative Models for Design Generation | Oct 13, 2023 | Spatial Reasoning | —Unverified | 0 | 0 |
| Intelligence of Things: A Spatial Context-Aware Control System for Smart Devices | Apr 16, 2025 | Spatial Reasoning | —Unverified | 0 | 0 |
| Jigsaw-Puzzles: From Seeing to Understanding to Reasoning in Vision-Language Models | May 27, 2025 | DiagnosticSpatial Reasoning | —Unverified | 0 | 0 |
| JSTR: Joint Spatio-Temporal Reasoning for Event-based Moving Object Detection | Mar 12, 2024 | Motion CompensationMoving Object Detection | —Unverified | 0 | 0 |
| LABNet: Local Graph Aggregation Network with Class Balanced Loss for Vehicle Re-Identification | Nov 29, 2020 | Spatial ReasoningVehicle Re-Identification | —Unverified | 0 | 0 |
| LanguageRefer: Spatial-Language Model for 3D Visual Grounding | Jul 7, 2021 | 3D visual groundingLanguage Modeling | —Unverified | 0 | 0 |
| Large Language-Geometry Model: When LLM meets Equivariance | Feb 16, 2025 | modelSpatial Reasoning | —Unverified | 0 | 0 |
| Large Language Models and Mathematical Reasoning Failures | Feb 17, 2025 | Mathematical ReasoningPhysical Intuition | —Unverified | 0 | 0 |
| Learning event representation: As sparse as possible, but not sparser | Oct 2, 2017 | ClassificationGeneral Classification | —Unverified | 0 | 0 |
| Learning to encode spatial relations from natural language | May 1, 2019 | Spatial Reasoning | —Unverified | 0 | 0 |
| LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning? | Mar 25, 2025 | Autonomous NavigationQuestion Answering | —Unverified | 0 | 0 |
| LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding | Dec 21, 2023 | Instruction FollowingLanguage Modeling | —Unverified | 0 | 0 |
| Location-Aware Self-Supervised Transformers for Semantic Segmentation | Dec 5, 2022 | Contrastive Learningimage-classification | —Unverified | 0 | 0 |
| Long Range Arena : A Benchmark for Efficient Transformers | Jan 1, 2021 | 16kBenchmarking | —Unverified | 0 | 0 |
| LVLM_CSP: Accelerating Large Vision Language Models via Clustering, Scattering, and Pruning for Reasoning Segmentation | Apr 15, 2025 | Image CaptioningQuestion Answering | —Unverified | 0 | 0 |
| M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning | Jul 11, 2025 | Spatial Reasoning | —Unverified | 0 | 0 |
| Manhattan Junction Catalogue for Spatial Reasoning of Indoor Scenes | Jun 1, 2013 | Junction DetectionSpatial Reasoning | —Unverified | 0 | 0 |
| Map Learning with Indistinguishable Locations | Mar 27, 2013 | Spatial Reasoning | —Unverified | 0 | 0 |
| Mathematical Definition and Systematization of Puzzle Rules | Dec 18, 2024 | Game DesignSpatial Reasoning | —Unverified | 0 | 0 |
| MEBench: A Novel Benchmark for Understanding Mutual Exclusivity Bias in Vision-Language Models | May 26, 2025 | Spatial Reasoning | —Unverified | 0 | 0 |
| MEgoHand: Multimodal Egocentric Hand-Object Interaction Motion Generation | May 22, 2025 | Motion GenerationObject | —Unverified | 0 | 0 |