SOTAVerified

Spatial Reasoning

Papers

Showing 151200 of 453 papers

TitleStatusHype
ImplicitQA: Going beyond frames towards Implicit Video ReasoningCode0
CoSpace: Benchmarking Continuous Space Perception Ability for Vision-Language ModelsCode0
SpatialRGPT: Grounded Spatial Reasoning in Vision Language ModelsCode0
SPhyR: Spatial-Physical Reasoning Benchmark on Material DistributionCode0
SpaRC and SpaRP: Spatial Reasoning Characterization and Path Generation for Understanding Spatial Reasoning Capability of Large Language ModelsCode0
SpaceNLI: Evaluating the Consistency of Predicting Inferences in SpaceCode0
Hierarchical Spatio-temporal Decoupling for Text-to-Video GenerationCode0
SORNet: Spatial Object-Centric Representations for Sequential ManipulationCode0
Guided Navigation from Multiple Viewpoints using Qualitative Spatial ReasoningCode0
Show, Don't Tell: Evaluating Large Language Models Beyond Textual Understanding with ChildPlayCode0
Grounding Spatial Relations in Text-Only Language ModelsCode0
Grounding Natural Language Instructions: Can Large Language Models Capture Spatial Information?Code0
Grounded Reinforcement Learning for Visual ReasoningCode0
Grid-augmented vision: A simple yet effective approach for enhanced spatial understanding in multi-modal agentsCode0
Representation Learning for Grounded Spatial ReasoningCode0
Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation dataCode0
Reframing Spatial Reasoning Evaluation in Language Models: A Real-World Simulation Benchmark for Qualitative ReasoningCode0
Scaling RL to Long VideosCode0
Polymath: A Challenging Multi-modal Mathematical Reasoning BenchmarkCode0
OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene UnderstandingCode0
From Text to Space: Mapping Abstract Spatial Models in LLMs during a Grid-World Navigation TaskCode0
Neural Task Synthesis for Visual ProgrammingCode0
cilantro: A Lean, Versatile, and Efficient Library for Point Cloud Data ProcessingCode0
Narrowing the Gap between Vision and Action in NavigationCode0
Neuro-symbolic Training for Reasoning over Spatial LanguageCode0
MM-GEN: Enhancing Task Performance Through Targeted Multimodal Data CurationCode0
No Blind Spots: Full-Surround Multi-Object Tracking for Autonomous Vehicles using Cameras & LiDARsCode0
FoREST: Frame of Reference Evaluation in Spatial Reasoning TasksCode0
LOViS: Learning Orientation and Visual Signals for Vision and Language NavigationCode0
Location-Aware Self-Supervised Transformers for Semantic SegmentationCode0
Location Aware Modular Biencoder for Tourism Question AnsweringCode0
MapEval: A Map-Based Evaluation of Geo-Spatial Reasoning in Foundation ModelsCode0
FloorNet: A Unified Framework for Floorplan Reconstruction from 3D ScansCode0
Are LLMs the Master of All Trades? : Exploring Domain-Agnostic Reasoning Skills of LLMsCode0
Can Vision Language Models Learn from Visual Demonstrations of Ambiguous Spatial Reasoning?Code0
Knowing Earlier what Right Means to You: A Comprehensive VQA Dataset for Grounding Relative Directions via Multi-Task LearningCode0
Explicit Object Relation Alignment for Vision and Language NavigationCode0
Expand VSR Benchmark for VLLM to Expertize in Spatial RulesCode0
Evaluation of Code LLMs on Geospatial Code GenerationCode0
SPaRC: A Spatial Pathfinding Reasoning ChallengeCode0
Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry PriorsCode0
Can Large Language Models Reason about the Region Connection Calculus?Code0
In-the-wild Audio Spatialization with Flexible Text-guided LocalizationCode0
Enhancing Logical Reasoning in Large Language Models through Graph-based Synthetic DataCode0
APT: Architectural Planning and Text-to-Blueprint Construction Using Large Language Models for Open-World AgentsCode0
Encoding Spatial Relations from Natural LanguageCode0
Enabling Systematic Generalization in Abstract Spatial Reasoning through Meta-Learning for CompositionalityCode0
Inherent limitations of LLMs regarding spatial informationCode0
Investigating Relational State Abstraction in Collaborative MARLCode0
Learning to Localize Objects Improves Spatial Reasoning in Visual-LLMsCode0
Show:102550
← PrevPage 4 of 10Next →

No leaderboard results yet.