SOTAVerified

Spatial Reasoning

Papers

Showing 351375 of 453 papers

TitleStatusHype
FlowVQA: Mapping Multimodal Logic in Visual Question Answering with Flowcharts0
FollowEval: A Multi-Dimensional Benchmark for Assessing the Instruction-Following Capability of Large Language Models0
Following Instructions by Imagining and Reaching Visual Goals0
Foundation Models for Remote Sensing: An Analysis of MLLMs for Object Localization0
FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors0
From 2D to 3D Cognition: A Brief Survey of General World Models0
From Objects to Anywhere: A Holistic Benchmark for Multi-level Visual Grounding in 3D Scenes0
From Patches to Objects: Exploiting Spatial Reasoning for Better Visual Representations0
From Spatial Relations to Spatial Configurations0
From Templates to Natural Language: Generalization Challenges in Instruction-Tuned LLMs for Spatial Reasoning0
Generating Human Motion in 3D Scenes from Text Descriptions0
Geo-LLaVA: A Large Multi-Modal Model for Solving Geometry Math Problems with Meta In-Context Learning0
Geometric Feature Enhanced Knowledge Graph Embedding and Spatial Reasoning0
Geometry of 3D Environments and Sum of Squares Polynomials0
Global Information Guided Video Anomaly Detection0
GPT-4o System Card0
Graph Relation Transformer: Incorporating pairwise object features into the Transformer architecture0
GRASP: A Grid-Based Benchmark for Evaluating Commonsense Spatial Reasoning0
Grounded Reinforcement Learning for Visual Reasoning0
GSR-BENCH: A Benchmark for Grounded Spatial Reasoning Evaluation via Multimodal LLMs0
HAMMR: HierArchical MultiModal React agents for generic VQA0
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation0
History-Aware Question Answering in a Blocks World Dialogue System0
How to Enable LLM with 3D Capacity? A Survey of Spatial Reasoning in LLM0
Hyperdimensional Computing with Spiking-Phasor Neurons0
Show:102550
← PrevPage 15 of 19Next →

No leaderboard results yet.