| An Empirical Study of Conformal Prediction in LLM with ASP Scaffolds for Robust Reasoning | Mar 7, 2025 | Conformal PredictionLanguage Modelling | —Unverified | 0 |
| A Neural Representation Framework with LLM-Driven Spatial Reasoning for Open-Vocabulary 3D Visual Grounding | Jul 9, 2025 | 3D visual groundingAutonomous Navigation | —Unverified | 0 |
| An Evaluation of ChatGPT-4's Qualitative Spatial Reasoning Capabilities in RCC-8 | Sep 27, 2023 | Spatial Reasoning | —Unverified | 0 |
| A Pilot Evaluation of ChatGPT and DALL-E 2 on Decision Making and Spatial Reasoning | Feb 15, 2023 | Decision MakingSpatial Reasoning | —Unverified | 0 |
| Dspy-based Neural-Symbolic Pipeline to Enhance Spatial Reasoning in LLMs | Nov 27, 2024 | Logical ReasoningSemantic Parsing | —Unverified | 0 |
| Architect: Generating Vivid and Interactive 3D Scenes with Hierarchical 2D Inpainting | Nov 14, 2024 | Depth EstimationImage Inpainting | —Unverified | 0 |
| Are Multimodal Large Language Models Ready for Omnidirectional Spatial Reasoning? | May 17, 2025 | HallucinationObject Counting | —Unverified | 0 |
| A Review of 3D Object Detection with Vision-Language Models | Apr 25, 2025 | 3D Object DetectionObject | —Unverified | 0 |
| A Schema-Guided Reason-while-Retrieve framework for Reasoning on Scene Graphs with Large-Language-Models (LLMs) | Feb 5, 2025 | HallucinationSpatial Reasoning | —Unverified | 0 |
| A Self-Supervised Auxiliary Loss for Deep RL in Partially Observable Settings | Apr 17, 2021 | NavigateSpatial Reasoning | —Unverified | 0 |
| A Solver-Aided Hierarchical Language for LLM-Driven CAD Design | Feb 13, 2025 | Spatial Reasoning | —Unverified | 0 |
| ASPMT(QS): Non-Monotonic Spatial Reasoning with Answer Set Programming Modulo Theories | Jun 16, 2015 | Spatial Reasoning | —Unverified | 0 |
| A Spoken Dialogue System for Spatial Question Answering in a Physical Blocks World | Nov 6, 2019 | Natural Language UnderstandingQuestion Answering | —Unverified | 0 |
| A Surprising Failure? Multimodal LLMs and the NLVR Challenge | Feb 26, 2024 | SentenceSpatial Reasoning | —Unverified | 0 |
| A Survey for Foundation Models in Autonomous Driving | Feb 2, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| A Survey of Large Language Model-Powered Spatial Intelligence Across Scales: Advances in Embodied Agents, Smart Cities, and Earth Science | Apr 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Symbolic Representation of Human Posture for Interpretable Learning and Reasoning | Oct 17, 2022 | Activity RecognitionSpatial Reasoning | —Unverified | 0 |
| Atari-GPT: Benchmarking Multimodal Large Language Models as Low-Level Policies in Atari Games | Aug 28, 2024 | Atari GamesBenchmarking | —Unverified | 0 |
| AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive Features | Jan 7, 2025 | 3D Object DetectionComputational Efficiency | —Unverified | 0 |
| A Vision Centric Remote Sensing Benchmark | Mar 20, 2025 | Question AnsweringRepresentation Learning | —Unverified | 0 |
| BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games | Nov 20, 2024 | BenchmarkingNetHack | —Unverified | 0 |
| Beyond Human Vision: The Role of Large Vision Language Models in Microscope Image Analysis | May 1, 2024 | Image CaptioningQuestion Answering | —Unverified | 0 |
| Beyond Recognition: Evaluating Visual Perspective Taking in Vision Language Models | May 3, 2025 | DiagnosticObject Recognition | —Unverified | 0 |
| Beyond Semantics: Rediscovering Spatial Awareness in Vision-Language Models | Mar 21, 2025 | DiagnosticObject Recognition | —Unverified | 0 |
| Beyond the Hype: A dispassionate look at vision-language models in medical scenario | Aug 16, 2024 | Question AnsweringSpatial Reasoning | —Unverified | 0 |
| Boosting Diffusion-Based Text Image Super-Resolution Model Towards Generalized Real-World Scenarios | Mar 10, 2025 | Image RestorationImage Super-Resolution | —Unverified | 0 |
| Bridging Visualization and Optimization: Multimodal Large Language Models on Graph-Structured Combinatorial Optimization | Jan 21, 2025 | Combinatorial OptimizationSequential Decision Making | —Unverified | 0 |
| ByDeWay: Boost Your multimodal LLM with DEpth prompting in a Training-Free Way | Jul 11, 2025 | Depth EstimationHallucination | —Unverified | 0 |
| CAD-GPT: Synthesising CAD Construction Sequence with Spatial Reasoning-Enhanced Multimodal LLMs | Dec 27, 2024 | Spatial Reasoning | —Unverified | 0 |
| Can Large Language Models Create New Knowledge for Spatial Reasoning Tasks? | May 23, 2024 | Spatial Reasoning | —Unverified | 0 |
| Can Large Multimodal Models Understand Agricultural Scenes? Benchmarking with AgroMind | May 18, 2025 | BenchmarkingScene Understanding | —Unverified | 0 |
| Can LLM be a Good Path Planner based on Prompt Engineering? Mitigating the Hallucination for Path Planning | Aug 23, 2024 | HallucinationPrompt Engineering | —Unverified | 0 |
| Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps | May 24, 2025 | Scene UnderstandingSpatial Reasoning | —Unverified | 0 |
| CASPER: Cognitive Architecture for Social Perception and Engagement in Robots | Sep 1, 2022 | Action RecognitionNavigate | —Unverified | 0 |
| Chain of Semantics Programming in 3D Gaussian Splatting Representation for 3D Vision Grounding | Jan 1, 2025 | 3DGSLarge Language Model | —Unverified | 0 |
| Challenge of Spatial Cognition for Deep Learning | Jul 30, 2019 | Deep LearningSpatial Reasoning | —Unverified | 0 |
| Challenges Faced by Large Language Models in Solving Multi-Agent Flocking | Apr 6, 2024 | Decision MakingSpatial Reasoning | —Unverified | 0 |
| CleverDistiller: Simple and Spatially Consistent Cross-modal Distillation | Mar 12, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Cog-GA: A Large Language Models-based Generative Agent for Vision-Language Navigation in Continuous Environments | Sep 4, 2024 | Continual LearningNavigate | —Unverified | 0 |
| Combining Deep Learning and Qualitative Spatial Reasoning to Learn Complex Structures from Sparse Examples with Noise | Nov 27, 2018 | AI AgentHeuristic Search | —Unverified | 0 |
| Commonsense Spatial Reasoning for Visually Intelligent Agents | Apr 1, 2021 | Spatial Reasoning | —Unverified | 0 |
| Commonsense Visual Sensemaking for Autonomous Driving: On Generalised Neurosymbolic Online Abduction Integrating Vision and Semantics | Dec 28, 2020 | Autonomous DrivingQuestion Answering | —Unverified | 0 |
| Complexity Classification in Infinite-Domain Constraint Satisfaction | Jan 4, 2012 | ClassificationGeneral Classification | —Unverified | 0 |
| Contextual Reasoning for Scene Generation (Technical Report) | May 3, 2023 | Scene GenerationSpatial Reasoning | —Unverified | 0 |
| Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training | Mar 4, 2024 | MathPhrase Grounding | —Unverified | 0 |
| Controllable Text-to-Image Generation with GPT-4 | May 29, 2023 | Image GenerationInstruction Following | —Unverified | 0 |
| DARE: Diverse Visual Question Answering with Robustness Evaluation | Sep 26, 2024 | image-classificationImage Classification | —Unverified | 0 |
| DataPlatter: Boosting Robotic Manipulation Generalization with Minimal Costly Data | Mar 25, 2025 | Robot ManipulationSpatial Reasoning | —Unverified | 0 |
| Dialectical language model evaluation: An initial appraisal of the commonsense spatial reasoning abilities of LLMs | Apr 22, 2023 | Language Model EvaluationLanguage Modeling | —Unverified | 0 |
| Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning | Jun 5, 2025 | In-Context LearningIndoor Scene Synthesis | —Unverified | 0 |