SOTAVerified

Spatial Reasoning

Papers

Showing 351400 of 453 papers

TitleStatusHype
A Survey for Foundation Models in Autonomous Driving0
Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation dataCode0
SpatialVLM: Endowing Vision-Language Models with Spatial Reasoning Capabilities0
StarCraftImage: A Dataset For Prototyping Spatial Reasoning Methods For Multi-Agent Environments0
Distortions in Judged Spatial Relations in Large Language Models0
Location Aware Modular Biencoder for Tourism Question AnsweringCode0
LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding0
Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation0
Inherent limitations of LLMs regarding spatial informationCode0
Exploring and Improving the Spatial Reasoning Abilities of Large Language Models0
FollowEval: A Multi-Dimensional Benchmark for Assessing the Instruction-Following Capability of Large Language Models0
Disentangling Extraction and Reasoning in Multi-hop Spatial ReasoningCode0
DepWiGNN: A Depth-wise Graph Neural Network for Multi-hop Spatial Reasoning in TextCode0
Evaluating Robustness of Visual Representations for Object Assembly Task Requiring Spatio-Geometrical Reasoning0
Integrating Symbolic Reasoning into Neural Generative Models for Design Generation0
SlotGNN: Unsupervised Discovery of Multi-Object Representations and Visual Dynamics0
An Evaluation of ChatGPT-4's Qualitative Spatial Reasoning Capabilities in RCC-80
Dense 2D-3D Indoor Prediction with Sound via Aligned Cross-Modal DistillationCode0
Multi-camera Bird's Eye View Perception for Autonomous Driving0
STUPD: A Synthetic Dataset for Spatial and Temporal Relation ReasoningCode0
Towards Grounded Visual Spatial Reasoning in Multi-Modal Vision Language Models0
Object Goal Navigation with Recursive Implicit Maps0
Spatial Intelligence of a Self-driving Car and Rule-Based Decision Making0
SpaceNLI: Evaluating the Consistency of Predicting Inferences in SpaceCode0
Controllable Text-to-Image Generation with GPT-40
Neural Task Synthesis for Visual ProgrammingCode0
Improved Algorithms for Allen's Interval Algebra by Dynamic Programming with Sublinear Partitioning0
EgoHumans: An Egocentric 3D Multi-Human BenchmarkCode0
From Patches to Objects: Exploiting Spatial Reasoning for Better Visual Representations0
Contextual Reasoning for Scene Generation (Technical Report)0
Dialectical language model evaluation: An initial appraisal of the commonsense spatial reasoning abilities of LLMs0
Are LLMs the Master of All Trades? : Exploring Domain-Agnostic Reasoning Skills of LLMsCode0
Morpho-logic from a Topos Perspective: Application to symbolic AI0
Hyperdimensional Computing with Spiking-Phasor Neurons0
A Pilot Evaluation of ChatGPT and DALL-E 2 on Decision Making and Spatial Reasoning0
Ego-Humans: An Ego-Centric 3D Multi-Human Benchmark0
OpenD: A Benchmark for Language-Driven Door and Drawer Opening0
Location-Aware Self-Supervised Transformers for Semantic Segmentation0
Spatial Reasoning for Few-Shot Object Detection0
A Symbolic Representation of Human Posture for Interpretable Learning and Reasoning0
LOViS: Learning Orientation and Visual Signals for Vision and Language NavigationCode0
Toward 3D Spatial Reasoning for Human-like Text-based Visual Question Answering0
CASPER: Cognitive Architecture for Social Perception and Engagement in Robots0
Knowing Earlier what Right Means to You: A Comprehensive VQA Dataset for Grounding Relative Directions via Multi-Task LearningCode0
Translating Place-Related Questions to GeoSPARQL QueriesCode0
Explicit Object Relation Alignment for Vision and Language NavigationCode0
DeepSSN: a deep convolutional neural network to assess spatial scene similarityCode0
ReCLIP: A Strong Zero-Shot Baseline for Referring Expression Comprehension0
Explicit Object Relation Alignment for Vision and Language Navigation0
Graph Relation Transformer: Incorporating pairwise object features into the Transformer architecture0
Show:102550
← PrevPage 8 of 10Next →

No leaderboard results yet.