SOTAVerified

Spatial Reasoning

Papers

Showing 176200 of 453 papers

TitleStatusHype
MEgoHand: Multimodal Egocentric Hand-Object Interaction Motion Generation0
VLM-R^3: Region Recognition, Reasoning, and Refinement for Enhanced Multimodal Chain-of-Thought0
DetailMaster: Can Your Text-to-Image Model Handle Long Prompts?0
SEM: Enhancing Spatial Understanding for Robust Robot Manipulation0
SPaRC: A Spatial Pathfinding Reasoning ChallengeCode0
MMMR: Benchmarking Massive Multi-Modal Reasoning Tasks0
STAR-R1: Spacial TrAnsformation Reasoning by Reinforcing Multimodal LLMsCode0
ReGUIDE: Data Efficient GUI Grounding via Spatial Reasoning and Search0
SPhyR: Spatial-Physical Reasoning Benchmark on Material DistributionCode0
Towards Embodied Cognition in Robots via Spatially Grounded Synthetic Worlds0
From Templates to Natural Language: Generalization Challenges in Instruction-Tuned LLMs for Spatial Reasoning0
Incentivizing Multimodal Reasoning in Large Models for Direct Robot Manipulation0
SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning0
Can Large Multimodal Models Understand Agricultural Scenes? Benchmarking with AgroMind0
Are Multimodal Large Language Models Ready for Omnidirectional Spatial Reasoning?0
PRS-Med: Position Reasoning Segmentation with Vision-Language Model in Medical Imaging0
A Light and Smart Wearable Platform with Multimodal Foundation Model for Enhanced Spatial Reasoning in People with Blindness and Low Vision0
SpatialPrompting: Keyframe-driven Zero-Shot Spatial Reasoning with Off-the-Shelf Multimodal Large Language Models0
SITE: towards Spatial Intelligence Thorough Evaluation0
Preliminary Explorations with GPT-4o(mni) Native Image Generation0
Beyond Recognition: Evaluating Visual Perspective Taking in Vision Language Models0
FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors0
SpatialLLM: A Compound 3D-Informed Design towards Spatially-Intelligent Large Multimodal Models0
First Order Logic with Fuzzy Semantics for Describing and Recognizing Nerves in Medical Images0
SpatialReasoner: Towards Explicit and Generalizable 3D Spatial Reasoning0
Show:102550
← PrevPage 8 of 19Next →

No leaderboard results yet.