SOTAVerified

Spatial Reasoning

Papers

Showing 401450 of 453 papers

TitleStatusHype
Mem2Ego: Empowering Vision-Language Models with Global-to-Ego Memory for Long-Horizon Embodied Navigation0
MindJourney: Test-Time Scaling with World Models for Spatial Reasoning0
MLLM-For3D: Adapting Multimodal Large Language Model for 3D Reasoning Segmentation0
MMMR: Benchmarking Massive Multi-Modal Reasoning Tasks0
MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence0
Morpho-logic from a Topos Perspective: Application to symbolic AI0
Multi-camera Bird's Eye View Perception for Autonomous Driving0
Non-Monotonic Spatial Reasoning with Answer Set Programming Modulo Theories0
NuScenes-SpatialQA: A Spatial Understanding and Reasoning Benchmark for Vision-Language Models in Autonomous Driving0
Object Goal Navigation with Recursive Implicit Maps0
OmniGeo: Towards a Multimodal Large Language Models for Geospatial Artificial Intelligence0
OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models0
On Redundant Topological Constraints0
On the Internal Topological Structure of Plane Regions0
OpenD: A Benchmark for Language-Driven Door and Drawer Opening0
OpenSU3D: Open World 3D Scene Understanding using Foundation Models0
Optimising Language Models for Downstream Tasks: A Post-Training Perspective0
Out of Sight, Not Out of Context? Egocentric Spatial Reasoning in VLMs Across Disjoint Frames0
Part Localization using Multi-Proposal Consensus for Fine-Grained Categorization0
Path-of-Thoughts: Extracting and Following Paths for Robust Relational Reasoning with Large Language Models0
PeRL: Permutation-Enhanced Reinforcement Learning for Interleaved Vision-Language Reasoning0
Coarse Correspondences Boost Spatial-Temporal Reasoning in Multimodal Language Model0
PhyBlock: A Progressive Benchmark for Physical Understanding and Planning via 3D Block Assembly0
PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs0
Pix2Scene: Learning Implicit 3D Representations from Images0
Poly2Vec: Polymorphic Fourier-Based Encoding of Geospatial Objects for GeoAI Applications0
Preliminary Explorations with GPT-4o(mni) Native Image Generation0
Proceedings of the 2nd Symposium on Problem-solving, Creativity and Spatial Reasoning in Cognitive Systems, ProSocrates 20170
PRS-Med: Position Reasoning Segmentation with Vision-Language Model in Medical Imaging0
Quantifying Geospatial in the Common Crawl Corpus0
R2C: Mapping Room to Chessboard to Unlock LLM As Low-Level Action Planner0
Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models0
ReCLIP: A Strong Zero-Shot Baseline for Referring Expression Comprehension0
ReGUIDE: Data Efficient GUI Grounding via Spatial Reasoning and Search0
Representation, Learning and Reasoning on Spatial Language for Downstream NLP Tasks0
ReSpace: Text-Driven 3D Scene Synthesis and Editing with Preference Alignment0
Re-Thinking Inverse Graphics With Large Language Models0
RLS3: RL-Based Synthetic Sample Selection to Enhance Spatial Reasoning in Vision-Language Models for Indoor Autonomous Perception0
RoboHop: Segment-based Topological Map Representation for Open-World Visual Navigation0
RoboPoint: A Vision-Language Model for Spatial Affordance Prediction for Robotics0
RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics0
RoboSpatial: Teaching Spatial Understanding to 2D and 3D Vision-Language Models for Robotics0
ROCKET-2: Steering Visuomotor Policy via Cross-View Goal Alignment0
RSRWKV: A Linear-Complexity 2D Attention Mechanism for Efficient Remote Sensing Vision Task0
SAVVY: Spatial Awareness via Audio-Visual LLMs through Seeing and Hearing0
Scaling RL to Long Videos0
SceneGPT: A Language Model for 3D Scene Understanding0
SpatialPIN: Enhancing Spatial Reasoning Capabilities of Vision-Language Models through Prompting and Interacting 3D Priors0
SEM: Enhancing Spatial Understanding for Robust Robot Manipulation0
ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models0
Show:102550
← PrevPage 9 of 10Next →

No leaderboard results yet.