SOTAVerified

Spatial Reasoning

Papers

Showing 151175 of 453 papers

TitleStatusHype
Navigating Motion Agents in Dynamic and Cluttered Environments through LLM Reasoning0
Beyond the Hype: A dispassionate look at vision-language models in medical scenario0
DriveVLM: The Convergence of Autonomous Driving and Large Vision-Language Models0
Beyond Semantics: Rediscovering Spatial Awareness in Vision-Language Models0
An Empirical Study of Conformal Prediction in LLM with ASP Scaffolds for Robust Reasoning0
Do Multimodal Language Models Really Understand Direction? A Benchmark for Compass Direction Reasoning0
Beyond Recognition: Evaluating Visual Perspective Taking in Vision Language Models0
DivCon: Divide and Conquer for Progressive Text-to-Image Generation0
Distortions in Judged Spatial Relations in Large Language Models0
Beyond Human Vision: The Role of Large Vision Language Models in Microscope Image Analysis0
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games0
Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning0
A Multi-Modal Spatial Risk Framework for EV Charging Infrastructure Using Remote Sensing0
Dialectical language model evaluation: An initial appraisal of the commonsense spatial reasoning abilities of LLMs0
DetailMaster: Can Your Text-to-Image Model Handle Long Prompts?0
A Vision Centric Remote Sensing Benchmark0
A dual contrastive framework0
Coarse Correspondences Boost Spatial-Temporal Reasoning in Multimodal Language Model0
AuxDepthNet: Real-Time Monocular 3D Object Detection with Depth-Sensitive Features0
AlphaSpace: Enabling Robotic Actions through Semantic Tokenization and Symbolic Reasoning0
DataPlatter: Boosting Robotic Manipulation Generalization with Minimal Costly Data0
DARE: Diverse Visual Question Answering with Robustness Evaluation0
Atari-GPT: Benchmarking Multimodal Large Language Models as Low-Level Policies in Atari Games0
Space-LLaVA: a Vision-Language Model Adapted to Extraterrestrial Applications0
A Light and Smart Wearable Platform with Multimodal Foundation Model for Enhanced Spatial Reasoning in People with Blindness and Low Vision0
Show:102550
← PrevPage 7 of 19Next →

No leaderboard results yet.