| Making History Matter: History-Advantage Sequence Training for Visual Dialog | Feb 25, 2019 | Answer GenerationDecoder | —Unverified | 0 |
| Can We Automate Diagrammatic Reasoning? | Feb 13, 2019 | Visual Reasoning | —Unverified | 0 |
| When Causal Intervention Meets Adversarial Examples and Image Masking for Deep Neural Networks | Feb 9, 2019 | Causal InferenceVisual Reasoning | CodeCode Available | 0 |
| Visual Entailment: A Novel Task for Fine-Grained Image Understanding | Jan 20, 2019 | Natural Language InferenceQuestion Answering | —Unverified | 0 |
| Visual Reasoning of Feature Attribution with Deep Recurrent Neural Networks | Jan 17, 2019 | ClassificationGeneral Classification | —Unverified | 0 |
| CLEVR-Ref+: Diagnosing Visual Reasoning with Referring Expressions | Jan 3, 2019 | DiagnosticImage Segmentation | CodeCode Available | 0 |
| Spatial Knowledge Distillation to aid Visual Reasoning | Dec 10, 2018 | DiagnosticKnowledge Distillation | —Unverified | 0 |
| Learning to Assemble Neural Module Tree Networks for Visual Grounding | Dec 8, 2018 | Dependency ParsingNatural Language Visual Grounding | —Unverified | 0 |
| Explainable and Explicit Visual Reasoning over Scene Graphs | Dec 5, 2018 | Inductive BiasVisual Question Answering (VQA) | CodeCode Available | 0 |
| Learning to Compose Dynamic Tree Structures for Visual Contexts | Dec 5, 2018 | Graph GenerationPanoptic Scene Graph Generation | CodeCode Available | 2 |
| A Corpus for Reasoning About Natural Language Grounded in Photographs | Nov 1, 2018 | DiversityVisual Reasoning | CodeCode Available | 0 |
| Cascaded Mutual Modulation for Visual Reasoning | Sep 6, 2018 | Question AnsweringVisual Question Answering | CodeCode Available | 0 |
| Mapping Natural Language Commands to Web Elements | Aug 28, 2018 | Relational ReasoningVisual Reasoning | CodeCode Available | 0 |
| Visual Reasoning with Multi-hop Feature Modulation | Aug 3, 2018 | Question AnsweringVisual Dialog | CodeCode Available | 0 |
| Weakly Supervised Semantic Parsing with Abstract Examples | Jul 1, 2018 | Semantic ParsingVisual Reasoning | —Unverified | 0 |
| Modularity Matters: Learning Invariant Relational Reasoning Tasks | Jun 18, 2018 | Mixture-of-ExpertsRelational Reasoning | —Unverified | 0 |
| Object Level Visual Reasoning in Videos | Jun 16, 2018 | Activity RecognitionHuman Activity Recognition | CodeCode Available | 0 |
| Visual Reasoning by Progressive Module Networks | Jun 6, 2018 | Visual Reasoning | CodeCode Available | 0 |
| Lexical Conceptual Structure of Literal and Metaphorical Spatial Language: A Case Study of ``Push'' | Jun 1, 2018 | Machine TranslationTranslation | —Unverified | 0 |
| Visual Choice of Plausible Alternatives: An Evaluation of Image-based Commonsense Causal Reasoning | May 1, 2018 | Commonsense Causal ReasoningImage Captioning | CodeCode Available | 0 |
| Object Ordering with Bidirectional Matchings for Visual Reasoning | Apr 18, 2018 | ObjectVisual Reasoning | —Unverified | 0 |
| Iterative Visual Reasoning Beyond Convolutions | Mar 29, 2018 | Visual Reasoning | —Unverified | 0 |
| A Dataset and Architecture for Visual Reasoning with a Working Memory | Mar 16, 2018 | DiagnosticLogical Reasoning | CodeCode Available | 0 |
| Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning | Mar 14, 2018 | Question AnsweringVisual Question Answering | CodeCode Available | 0 |
| Compositional Attention Networks for Machine Reasoning | Mar 8, 2018 | Referring Expression ComprehensionVisual Question Answering (VQA) | CodeCode Available | 1 |
| Same-different problems strain convolutional neural networks | Feb 9, 2018 | MemorizationVisual Reasoning | —Unverified | 0 |
| Benchmark Visual Question Answer Models by using Focus Map | Jan 13, 2018 | Visual Reasoning | —Unverified | 0 |
| Not-So-CLEVR: Visual Relations Strain Feedforward Neural Networks | Jan 1, 2018 | MemorizationQuestion Answering | —Unverified | 0 |
| Learning to Act Properly: Predicting and Explaining Affordances from Images | Dec 20, 2017 | Visual Reasoning | —Unverified | 0 |
| Multi-Label Zero-Shot Learning with Structured Knowledge Graphs | Nov 17, 2017 | General ClassificationKnowledge Graphs | CodeCode Available | 0 |
| Weakly-supervised Semantic Parsing with Abstract Examples | Nov 14, 2017 | Semantic ParsingVisual Reasoning | CodeCode Available | 0 |
| Complete 3D Scene Parsing from an RGBD Image | Oct 25, 2017 | DiversityRetrieval | CodeCode Available | 0 |
| FigureQA: An Annotated Figure Dataset for Visual Reasoning | Oct 19, 2017 | BIG-bench Machine LearningChart Question Answering | CodeCode Available | 0 |
| Visual Reasoning with Natural Language | Oct 2, 2017 | DescriptiveDiversity | —Unverified | 0 |
| FiLM: Visual Reasoning with a General Conditioning Layer | Sep 22, 2017 | Image Retrieval with Multi-Modal QueryVisual Question Answering (VQA) | CodeCode Available | 1 |
| VSE++: Improving Visual-Semantic Embeddings with Hard Negatives | Jul 18, 2017 | Cross-Modal RetrievalImage Retrieval | CodeCode Available | 1 |
| Learning Visual Reasoning Without Strong Priors | Jul 10, 2017 | Visual Reasoning | CodeCode Available | 0 |
| End-to-End Learning of Semantic Grasping | Jul 6, 2017 | Objectobject-detection | —Unverified | 0 |
| A Corpus of Natural Language for Visual Reasoning | Jul 1, 2017 | Question AnsweringVisual Question Answering (VQA) | —Unverified | 0 |
| How a General-Purpose Commonsense Ontology can Improve Performance of Learning-Based Image Retrieval | May 24, 2017 | Image RetrievalRetrieval | CodeCode Available | 0 |
| Inferring and Executing Programs for Visual Reasoning | May 10, 2017 | Visual Question Answering (VQA)Visual Reasoning | CodeCode Available | 0 |
| EgoReID: Cross-view Self-Identification and Human Re-identification in Egocentric and Surveillance Videos | Dec 24, 2016 | Person Re-IdentificationVisual Reasoning | —Unverified | 0 |
| CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning | Dec 20, 2016 | DiagnosticQuestion Answering | CodeCode Available | 1 |
| Dual Local-Global Contextual Pathways for Recognition in Aerial Imagery | May 18, 2016 | Object RecognitionRoad Segmentation | —Unverified | 0 |
| Filling in the details: Perceiving from low fidelity images | Apr 14, 2016 | FoveationVisual Reasoning | —Unverified | 0 |
| Are Elephants Bigger than Butterflies? Reasoning about Sizes of Objects | Feb 2, 2016 | Visual Reasoning | —Unverified | 0 |
| Predicting Complete 3D Models of Indoor Scenes | Apr 9, 2015 | DiversityVisual Reasoning | CodeCode Available | 0 |
| Factorization of View-Object Manifolds for Joint Object Recognition and Pose Estimation | Mar 23, 2015 | ObjectObject Recognition | —Unverified | 0 |