SOTAVerified

Spatial Reasoning

Papers

Showing 5160 of 453 papers

TitleStatusHype
Act3D: 3D Feature Field Transformers for Multi-Task Robotic ManipulationCode2
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language ModelsCode2
ConceptFusion: Open-set Multimodal 3D MappingCode2
Warehouse Spatial Question Answering with LLM AgentCode1
3D-Aware Vision-Language Models Fine-Tuning with Geometric DistillationCode1
Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual SimulationsCode1
VideoCAD: A Large-Scale Video Dataset for Learning UI Interactions and 3D Reasoning from CAD SoftwareCode1
Seeing is Not Reasoning: MVPBench for Graph-based Evaluation of Multi-path Visual Physical CoTCode1
Uni-MuMER: Unified Multi-Task Fine-Tuning of Vision-Language Model for Handwritten Mathematical Expression RecognitionCode1
ChatVLA-2: Vision-Language-Action Model with Open-World Embodied Reasoning from Pretrained KnowledgeCode1
Show:102550
← PrevPage 6 of 46Next →

No leaderboard results yet.