SOTAVerified

Spatial Reasoning

Papers

Showing 151200 of 453 papers

TitleStatusHype
Navigating Motion Agents in Dynamic and Cluttered Environments through LLM Reasoning0
Beyond the Hype: A dispassionate look at vision-language models in medical scenario0
DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models0
Beyond Semantics: Rediscovering Spatial Awareness in Vision-Language Models0
An Empirical Study of Conformal Prediction in LLM with ASP Scaffolds for Robust Reasoning0
LVLM_CSP: Accelerating Large Vision Language Models via Clustering, Scattering, and Pruning for Reasoning Segmentation0
M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning0
Do Multimodal Language Models Really Understand Direction? A Benchmark for Compass Direction Reasoning0
Beyond Recognition: Evaluating Visual Perspective Taking in Vision Language Models0
DivCon: Divide and Conquer for Progressive Text-to-Image Generation0
Distortions in Judged Spatial Relations in Large Language Models0
Beyond Human Vision: The Role of Large Vision Language Models in Microscope Image Analysis0
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games0
Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning0
A dual contrastive framework0
Dialectical language model evaluation: An initial appraisal of the commonsense spatial reasoning abilities of LLMs0
A Vision Centric Remote Sensing Benchmark0
A Multi-Modal Spatial Risk Framework for EV Charging Infrastructure Using Remote Sensing0
Long Range Arena : A Benchmark for Efficient Transformers0
Coarse Correspondences Boost Spatial-Temporal Reasoning in Multimodal Language Model0
AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive Features0
AlphaSpace: Enabling Robotic Actions through Semantic Tokenization and Symbolic Reasoning0
DataPlatter: Boosting Robotic Manipulation Generalization with Minimal Costly Data0
DARE: Diverse Visual Question Answering with Robustness Evaluation0
Atari-GPT: Benchmarking Multimodal Large Language Models as Low-Level Policies in Atari Games0
Improved Algorithms for Allen's Interval Algebra by Dynamic Programming with Sublinear Partitioning0
Space-LLaVA: a Vision-Language Model Adapted to Extraterrestrial Applications0
ImmerseGen: Agent-Guided Immersive World Generation with Alpha-Textured Proxies0
A Light and Smart Wearable Platform with Multimodal Foundation Model for Enhanced Spatial Reasoning in People with Blindness and Low Vision0
LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding0
Manhattan Junction Catalogue for Spatial Reasoning of Indoor Scenes0
Controllable Text-to-Image Generation with GPT-40
How to Enable LLM with 3D Capacity? A Survey of Spatial Reasoning in LLM0
A Symbolic Representation of Human Posture for Interpretable Learning and Reasoning0
History-Aware Question Answering in a Blocks World Dialogue System0
Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training0
A Survey of Large Language Model-Powered Spatial Intelligence Across Scales: Advances in Embodied Agents, Smart Cities, and Earth Science0
HAMMR: HierArchical MultiModal React agents for generic VQA0
Hyperdimensional Computing with Spiking-Phasor Neurons0
I Know About "Up"! Enhancing Spatial Reasoning in Visual Language Models Through 3D Reconstruction0
Contextual Reasoning for Scene Generation (Technical Report)0
GSR-BENCH: A Benchmark for Grounded Spatial Reasoning Evaluation via Multimodal LLMs0
A Survey for Foundation Models in Autonomous Driving0
LanguageRefer: Spatial-Language Model for 3D Visual Grounding0
Large Language-Geometry Model: When LLM meets Equivariance0
Complexity Classification in Infinite-Domain Constraint Satisfaction0
Incentivizing Multimodal Reasoning in Large Models for Direct Robot Manipulation0
A Surprising Failure? Multimodal LLMs and the NLVR Challenge0
Commonsense Visual Sensemaking for Autonomous Driving: On Generalised Neurosymbolic Online Abduction Integrating Vision and Semantics0
A LLM Benchmark based on the Minecraft Builder Dialog Agent Task0
Show:102550
← PrevPage 4 of 10Next →

No leaderboard results yet.