SOTAVerified

Spatial Reasoning

Papers

Showing 4150 of 453 papers

TitleStatusHype
ChatVLA-2: Vision-Language-Action Model with Open-World Embodied Reasoning from Pretrained KnowledgeCode1
VLM Can Be a Good Assistant: Enhancing Embodied Visual Tracking with Self-Improving Vision-Language Models0
Jigsaw-Puzzles: From Seeing to Understanding to Reasoning in Vision-Language Models0
MineAnyBuild: Benchmarking Spatial Planning for Open-world AI AgentsCode1
VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D ReconstructionCode3
MEBench: A Novel Benchmark for Understanding Mutual Exclusivity Bias in Vision-Language Models0
Agentic 3D Scene Generation with Spatially Contextualized VLMs0
ViTaPEs: Visuotactile Position Encodings for Cross-Modal Alignment in Multimodal Transformers0
Can MLLMs Guide Me Home? A Benchmark Study on Fine-Grained Visual Reasoning from Transit Maps0
U2-BENCH: Benchmarking Large Vision-Language Models on Ultrasound Understanding0
Show:102550
← PrevPage 5 of 46Next →

No leaderboard results yet.