SOTAVerified

Spatial Reasoning

Papers

Showing 5175 of 453 papers

TitleStatusHype
Towards Dynamic 3D Reconstruction of Hand-Instrument Interaction in Ophthalmic Surgery0
Knot So Simple: A Minimalistic Environment for Spatial ReasoningCode1
DetailMaster: Can Your Text-to-Image Model Handle Long Prompts?Code0
Bridging the Dynamic Perception Gap: Training-Free Draft Chain-of-Thought for Dynamic Multimodal Spatial ReasoningCode0
MEgoHand: Multimodal Egocentric Hand-Object Interaction Motion Generation0
SPaRC: A Spatial Pathfinding Reasoning ChallengeCode0
VLM-R^3: Region Recognition, Reasoning, and Refinement for Enhanced Multimodal Chain-of-Thought0
MMMR: Benchmarking Massive Multi-Modal Reasoning Tasks0
SpatialScore: Towards Unified Evaluation for Multimodal Spatial UnderstandingCode2
CoNav: Collaborative Cross-Modal Reasoning for Embodied NavigationCode1
SEM: Enhancing Spatial Understanding for Robust Robot Manipulation0
GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement LearningCode2
SPhyR: Spatial-Physical Reasoning Benchmark on Material DistributionCode0
STAR-R1: Spacial TrAnsformation Reasoning by Reinforcing Multimodal LLMsCode0
ReGUIDE: Data Efficient GUI Grounding via Spatial Reasoning and Search0
From Templates to Natural Language: Generalization Challenges in Instruction-Tuned LLMs for Spatial Reasoning0
Towards Embodied Cognition in Robots via Spatially Grounded Synthetic Worlds0
Incentivizing Multimodal Reasoning in Large Models for Direct Robot Manipulation0
Visuospatial Cognitive AssistantCode1
SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning0
Can Large Multimodal Models Understand Agricultural Scenes? Benchmarking with AgroMind0
Towards Visuospatial Cognition via Hierarchical Fusion of Visual ExpertsCode1
Are Multimodal Large Language Models Ready for Omnidirectional Spatial Reasoning?0
PRS-Med: Position Reasoning Segmentation with Vision-Language Model in Medical Imaging0
A Light and Smart Wearable Platform with Multimodal Foundation Model for Enhanced Spatial Reasoning in People with Blindness and Low Vision0
Show:102550
← PrevPage 3 of 19Next →

No leaderboard results yet.