SOTAVerified

Spatial Reasoning

Papers

Showing 426450 of 453 papers

TitleStatusHype
Poly2Vec: Polymorphic Fourier-Based Encoding of Geospatial Objects for GeoAI Applications0
Preliminary Explorations with GPT-4o(mni) Native Image Generation0
Proceedings of the 2nd Symposium on Problem-solving, Creativity and Spatial Reasoning in Cognitive Systems, ProSocrates 20170
PRS-Med: Position Reasoning Segmentation with Vision-Language Model in Medical Imaging0
Quantifying Geospatial in the Common Crawl Corpus0
R2C: Mapping Room to Chessboard to Unlock LLM As Low-Level Action Planner0
Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models0
ReCLIP: A Strong Zero-Shot Baseline for Referring Expression Comprehension0
ReGUIDE: Data Efficient GUI Grounding via Spatial Reasoning and Search0
Representation, Learning and Reasoning on Spatial Language for Downstream NLP Tasks0
ReSpace: Text-Driven 3D Scene Synthesis and Editing with Preference Alignment0
Re-Thinking Inverse Graphics With Large Language Models0
RLS3: RL-Based Synthetic Sample Selection to Enhance Spatial Reasoning in Vision-Language Models for Indoor Autonomous Perception0
RoboHop: Segment-based Topological Map Representation for Open-World Visual Navigation0
RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics0
RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics0
RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics0
ROCKET-2: Steering Visuomotor Policy via Cross-View Goal Alignment0
RSRWKV: A Linear-Complexity 2D Attention Mechanism for Efficient Remote Sensing Vision Task0
SAVVY: Spatial Awareness via Audio-Visual LLMs through Seeing and Hearing0
Scaling RL to Long Videos0
SceneGPT: A Language Model for 3D Scene Understanding0
SpatialPIN: Enhancing Spatial Reasoning Capabilities of Vision-Language Models through Prompting and Interacting 3D Priors0
SEM: Enhancing Spatial Understanding for Robust Robot Manipulation0
ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models0
Show:102550
← PrevPage 18 of 19Next →

No leaderboard results yet.