| GPT-4 Technical Report | Mar 15, 2023 | answerability predictionArithmetic Reasoning | CodeCode Available | 6 |
| Morpho-logic from a Topos Perspective: Application to symbolic AI | Mar 8, 2023 | Spatial Reasoning | —Unverified | 0 |
| Hyperdimensional Computing with Spiking-Phasor Neurons | Feb 28, 2023 | Spatial Reasoning | —Unverified | 0 |
| A Pilot Evaluation of ChatGPT and DALL-E 2 on Decision Making and Spatial Reasoning | Feb 15, 2023 | Decision MakingSpatial Reasoning | —Unverified | 0 |
| ConceptFusion: Open-set Multimodal 3D Mapping | Feb 14, 2023 | 3D geometryAutonomous Driving | CodeCode Available | 2 |
| Translating Natural Language to Planning Goals with Large-Language Models | Feb 10, 2023 | Spatial ReasoningTranslation | CodeCode Available | 1 |
| Ego-Humans: An Ego-Centric 3D Multi-Human Benchmark | Jan 1, 2023 | 3D Pose EstimationHuman Detection | —Unverified | 0 |
| Are Deep Neural Networks SMARTer than Second Graders? | Dec 20, 2022 | Language ModellingMeta-Learning | CodeCode Available | 1 |
| OpenD: A Benchmark for Language-Driven Door and Drawer Opening | Dec 10, 2022 | Spatial Reasoning | —Unverified | 0 |
| Location-Aware Self-Supervised Transformers for Semantic Segmentation | Dec 5, 2022 | Contrastive Learningimage-classification | —Unverified | 0 |
| Spatial Reasoning for Few-Shot Object Detection | Nov 2, 2022 | Data AugmentationFew-Shot Object Detection | —Unverified | 0 |
| A Symbolic Representation of Human Posture for Interpretable Learning and Reasoning | Oct 17, 2022 | Activity RecognitionSpatial Reasoning | —Unverified | 0 |
| LOViS: Learning Orientation and Visual Signals for Vision and Language Navigation | Sep 26, 2022 | Spatial ReasoningVision and Language Navigation | CodeCode Available | 0 |
| Toward 3D Spatial Reasoning for Human-like Text-based Visual Question Answering | Sep 21, 2022 | Image CaptioningOptical Character Recognition (OCR) | —Unverified | 0 |
| CASPER: Cognitive Architecture for Social Perception and Engagement in Robots | Sep 1, 2022 | Action RecognitionNavigate | —Unverified | 0 |
| Knowing Earlier what Right Means to You: A Comprehensive VQA Dataset for Grounding Relative Directions via Multi-Task Learning | Jul 6, 2022 | DiagnosticMulti-Task Learning | CodeCode Available | 0 |
| Translating Place-Related Questions to GeoSPARQL Queries | May 6, 2022 | Geographic Question AnsweringQuestion Answering | CodeCode Available | 0 |
| Explicit Object Relation Alignment for Vision and Language Navigation | May 1, 2022 | ObjectRelation | CodeCode Available | 0 |
| Visual Spatial Reasoning | Apr 30, 2022 | Spatial Reasoning | CodeCode Available | 1 |
| StepGame: A New Benchmark for Robust Multi-Hop Spatial Reasoning in Texts | Apr 18, 2022 | Question AnsweringSpatial Reasoning | CodeCode Available | 1 |
| ReCLIP: A Strong Zero-Shot Baseline for Referring Expression Comprehension | Apr 12, 2022 | image-classificationImage Classification | CodeCode Available | 1 |
| Capturing Shape Information with Multi-Scale Topological Loss Terms for 3D Reconstruction | Mar 3, 2022 | 3D ReconstructionSpatial Reasoning | CodeCode Available | 1 |
| DeepSSN: a deep convolutional neural network to assess spatial scene similarity | Feb 7, 2022 | Data AugmentationInformation Retrieval | CodeCode Available | 0 |
| ReCLIP: A Strong Zero-Shot Baseline for Referring Expression Comprehension | Nov 16, 2021 | image-classificationImage Classification | —Unverified | 0 |
| Explicit Object Relation Alignment for Vision and Language Navigation | Nov 16, 2021 | Instruction FollowingRelation | —Unverified | 0 |
| Graph Relation Transformer: Incorporating pairwise object features into the Transformer architecture | Nov 11, 2021 | Graph AttentionQuestion Answering | —Unverified | 0 |
| Revisiting spatio-temporal layouts for compositional action recognition | Nov 2, 2021 | Action ClassificationAction Detection | CodeCode Available | 1 |
| IndoNLI: A Natural Language Inference Dataset for Indonesian | Oct 27, 2021 | Natural Language InferenceSentence | CodeCode Available | 1 |
| Unsupervised Representation Learning Facilitates Human-like Spatial Reasoning | Oct 12, 2021 | Representation LearningSpatial Reasoning | —Unverified | 0 |
| CLIPort: What and Where Pathways for Robotic Manipulation | Sep 24, 2021 | Imitation LearningRobotic Grasping | CodeCode Available | 1 |
| Grounding Natural Language Instructions: Can Large Language Models Capture Spatial Information? | Sep 17, 2021 | Spatial Reasoning | CodeCode Available | 0 |
| SORNet: Spatial Object-Centric Representations for Sequential Manipulation | Sep 8, 2021 | ObjectRelation Classification | CodeCode Available | 0 |
| Weakly Supervised Relative Spatial Reasoning for Visual Question Answering | Sep 4, 2021 | Question AnsweringSpatial Reasoning | CodeCode Available | 0 |
| Teaching Agents how to Map: Spatial Reasoning for Multi-Object Navigation | Jul 13, 2021 | Reinforcement Learning (RL)Spatial Reasoning | CodeCode Available | 1 |
| LanguageRefer: Spatial-Language Model for 3D Visual Grounding | Jul 7, 2021 | 3D visual groundingLanguage Modeling | —Unverified | 0 |
| SPARTQA: A Textual Question Answering Benchmark for Spatial Reasoning | Jun 1, 2021 | Question AnsweringSpatial Reasoning | CodeCode Available | 1 |
| SBEVNet: End-to-End Deep Stereo Layout Estimation | May 25, 2021 | Depth EstimationDisparity Estimation | CodeCode Available | 1 |
| Towards Navigation by Reasoning over Spatial Configurations | May 14, 2021 | Spatial Reasoning | —Unverified | 0 |
| Self-supervised Spatial Reasoning on Multi-View Line Drawings | Apr 27, 2021 | Binary ClassificationContrastive Learning | CodeCode Available | 1 |
| A Self-Supervised Auxiliary Loss for Deep RL in Partially Observable Settings | Apr 17, 2021 | NavigateSpatial Reasoning | —Unverified | 0 |
| Global Information Guided Video Anomaly Detection | Apr 14, 2021 | Anomaly DetectionSpatial Reasoning | —Unverified | 0 |
| SpartQA: : A Textual Question Answering Benchmark for Spatial Reasoning | Apr 12, 2021 | Question AnsweringSpatial Reasoning | CodeCode Available | 1 |
| Commonsense Spatial Reasoning for Visually Intelligent Agents | Apr 1, 2021 | Spatial Reasoning | —Unverified | 0 |
| Stride and Translation Invariance in CNNs | Mar 18, 2021 | Data Augmentationimage-classification | —Unverified | 0 |
| End-to-End Egospheric Spatial Memory | Feb 15, 2021 | General Reinforcement LearningImitation Learning | CodeCode Available | 1 |
| Multi-scale GCN-assisted two-stage network for joint segmentation of retinal layers and disc in peripapillary OCT images | Feb 9, 2021 | DecoderMedical Image Segmentation | CodeCode Available | 1 |
| Grounding Consistency: Distilling Spatial Common Sense for Precise Visual Relationship Detection | Jan 1, 2021 | Common Sense ReasoningGraph Generation | CodeCode Available | 1 |
| Long Range Arena : A Benchmark for Efficient Transformers | Jan 1, 2021 | 16kBenchmarking | —Unverified | 0 |
| Ego-Centric Spatial Memory Networks | Jan 1, 2021 | CPUGPU | —Unverified | 0 |
| Commonsense Visual Sensemaking for Autonomous Driving: On Generalised Neurosymbolic Online Abduction Integrating Vision and Semantics | Dec 28, 2020 | Autonomous DrivingQuestion Answering | —Unverified | 0 |