SOTAVerified

Spatial Reasoning

Papers

Showing 351400 of 453 papers

TitleStatusHype
GPT-4 Technical ReportCode6
Morpho-logic from a Topos Perspective: Application to symbolic AI0
Hyperdimensional Computing with Spiking-Phasor Neurons0
A Pilot Evaluation of ChatGPT and DALL-E 2 on Decision Making and Spatial Reasoning0
ConceptFusion: Open-set Multimodal 3D MappingCode2
Translating Natural Language to Planning Goals with Large-Language ModelsCode1
Ego-Humans: An Ego-Centric 3D Multi-Human Benchmark0
Are Deep Neural Networks SMARTer than Second Graders?Code1
OpenD: A Benchmark for Language-Driven Door and Drawer Opening0
Location-Aware Self-Supervised Transformers for Semantic Segmentation0
Spatial Reasoning for Few-Shot Object Detection0
A Symbolic Representation of Human Posture for Interpretable Learning and Reasoning0
LOViS: Learning Orientation and Visual Signals for Vision and Language NavigationCode0
Toward 3D Spatial Reasoning for Human-like Text-based Visual Question Answering0
CASPER: Cognitive Architecture for Social Perception and Engagement in Robots0
Knowing Earlier what Right Means to You: A Comprehensive VQA Dataset for Grounding Relative Directions via Multi-Task LearningCode0
Translating Place-Related Questions to GeoSPARQL QueriesCode0
Explicit Object Relation Alignment for Vision and Language NavigationCode0
Visual Spatial ReasoningCode1
StepGame: A New Benchmark for Robust Multi-Hop Spatial Reasoning in TextsCode1
ReCLIP: A Strong Zero-Shot Baseline for Referring Expression ComprehensionCode1
Capturing Shape Information with Multi-Scale Topological Loss Terms for 3D ReconstructionCode1
DeepSSN: a deep convolutional neural network to assess spatial scene similarityCode0
ReCLIP: A Strong Zero-Shot Baseline for Referring Expression Comprehension0
Explicit Object Relation Alignment for Vision and Language Navigation0
Graph Relation Transformer: Incorporating pairwise object features into the Transformer architecture0
Revisiting spatio-temporal layouts for compositional action recognitionCode1
IndoNLI: A Natural Language Inference Dataset for IndonesianCode1
Unsupervised Representation Learning Facilitates Human-like Spatial Reasoning0
CLIPort: What and Where Pathways for Robotic ManipulationCode1
Grounding Natural Language Instructions: Can Large Language Models Capture Spatial Information?Code0
SORNet: Spatial Object-Centric Representations for Sequential ManipulationCode0
Weakly Supervised Relative Spatial Reasoning for Visual Question AnsweringCode0
Teaching Agents how to Map: Spatial Reasoning for Multi-Object NavigationCode1
LanguageRefer: Spatial-Language Model for 3D Visual Grounding0
SPARTQA: A Textual Question Answering Benchmark for Spatial ReasoningCode1
SBEVNet: End-to-End Deep Stereo Layout EstimationCode1
Towards Navigation by Reasoning over Spatial Configurations0
Self-supervised Spatial Reasoning on Multi-View Line DrawingsCode1
A Self-Supervised Auxiliary Loss for Deep RL in Partially Observable Settings0
Global Information Guided Video Anomaly Detection0
SpartQA: : A Textual Question Answering Benchmark for Spatial ReasoningCode1
Commonsense Spatial Reasoning for Visually Intelligent Agents0
Stride and Translation Invariance in CNNs0
End-to-End Egospheric Spatial MemoryCode1
Multi-scale GCN-assisted two-stage network for joint segmentation of retinal layers and disc in peripapillary OCT imagesCode1
Grounding Consistency: Distilling Spatial Common Sense for Precise Visual Relationship DetectionCode1
Long Range Arena : A Benchmark for Efficient Transformers0
Ego-Centric Spatial Memory Networks0
Commonsense Visual Sensemaking for Autonomous Driving: On Generalised Neurosymbolic Online Abduction Integrating Vision and Semantics0
Show:102550
← PrevPage 8 of 10Next →

No leaderboard results yet.