| Stacked Latent Attention for Multimodal Reasoning | Jun 1, 2018 | Image CaptioningMultimodal Reasoning | —Unverified | 0 |
| Categorizing Concepts With Basic Level for Vision-to-Language | Jun 1, 2018 | ClusteringImage Captioning | —Unverified | 0 |
| Stacking with Auxiliary Features for Visual Question Answering | Jun 1, 2018 | Common Sense ReasoningQuestion Answering | —Unverified | 0 |
| Hyperbolic Attention Networks | May 24, 2018 | Machine TranslationQuestion Answering | —Unverified | 0 |
| Reproducibility Report for "Learning To Count Objects In Natural Images For Visual Question Answering" | May 21, 2018 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Did the Model Understand the Question? | May 14, 2018 | modelQuestion Answering | CodeCode Available | 0 |
| Reciprocal Attention Fusion for Visual Question Answering | May 11, 2018 | ObjectQuestion Answering | —Unverified | 0 |
| Large Scale Scene Text Verification with Guided Attention | Apr 23, 2018 | Question AnsweringScene Text Detection | —Unverified | 0 |
| Question Type Guided Attention in Visual Question Answering | Apr 6, 2018 | Activity RecognitionQuestion Answering | —Unverified | 0 |
| Improved Fusion of Visual and Language Representations by Dense Symmetric Co-Attention for Visual Question Answering | Apr 3, 2018 | Visual Question AnsweringVisual Question Answering (VQA) | CodeCode Available | 0 |
| Differential Attention for Visual Question Answering | Apr 1, 2018 | Question AnsweringVisual Question Answering | CodeCode Available | 0 |
| Visual Question Reasoning on General Dependency Tree | Mar 31, 2018 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| DDRprog: A CLEVR Differentiable Dynamic Reasoning Programmer | Mar 30, 2018 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Two can play this Game: Visual Dialog with Discriminative Question Generation and Answering | Mar 29, 2018 | Image CaptioningQuestion Answering | —Unverified | 0 |
| Generalized Hadamard-Product Fusion Operators for Visual Question Answering | Mar 26, 2018 | Neural Architecture SearchQuestion Answering | —Unverified | 0 |
| Explicit Reasoning over End-to-End Neural Architectures for Visual Question Answering | Mar 23, 2018 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Attention on Attention: Architectures for Visual Question Answering (VQA) | Mar 21, 2018 | GPUQuestion Answering | CodeCode Available | 0 |
| VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual Questions | Mar 20, 2018 | Explanatory Visual Question AnsweringMulti-Task Learning | —Unverified | 0 |
| Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis Tool | Mar 16, 2018 | Question AnsweringReinforcement Learning | —Unverified | 0 |
| Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning | Mar 14, 2018 | Question AnsweringVisual Question Answering | CodeCode Available | 0 |
| VizWiz Grand Challenge: Answering Visual Questions from Blind People | Feb 22, 2018 | Question AnsweringVisual Question Answering | CodeCode Available | 0 |
| Multimodal Explanations: Justifying Decisions and Pointing to the Evidence | Feb 15, 2018 | Activity RecognitionExplainable Models | CodeCode Available | 0 |
| Learning to Count Objects in Natural Images for Visual Question Answering | Feb 15, 2018 | Visual Question AnsweringVisual Question Answering (VQA) | CodeCode Available | 0 |
| Generating Triples with Adversarial Networks for Scene Graph Construction | Feb 7, 2018 | Attributegraph construction | —Unverified | 0 |
| Dual Recurrent Attention Units for Visual Question Answering | Feb 1, 2018 | Question AnsweringVisual Question Answering | CodeCode Available | 0 |