| Deep learning evaluation using deep linguistic processing | Jun 5, 2017 | Deep LearningMultimodal Deep Learning | —Unverified | 0 |
| A simple neural network module for relational reasoning | Jun 5, 2017 | Image Retrieval with Multi-Modal QueryQuestion Answering | CodeCode Available | 0 |
| MUTAN: Multimodal Tucker Fusion for Visual Question Answering | May 18, 2017 | Visual Question AnsweringVisual Question Answering (VQA) | CodeCode Available | 0 |
| Learning Convolutional Text Representations for Visual Question Answering | May 18, 2017 | General Classificationimage-classification | CodeCode Available | 0 |
| Survey of Visual Question Answering: Datasets and Techniques | May 10, 2017 | Deep LearningQuestion Answering | —Unverified | 0 |
| The Forgettable-Watcher Model for Video Question Answering | May 3, 2017 | modelQuestion Answering | —Unverified | 0 |
| Speech-Based Visual Question Answering | May 1, 2017 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| The Promise of Premise: Harnessing Question Premises in Visual Question Answering | May 1, 2017 | Question AnsweringRelevance Detection | CodeCode Available | 0 |
| C-VQA: A Compositional Split of the Visual Question Answering (VQA) v1.0 Dataset | Apr 26, 2017 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Being Negative but Constructively: Lessons Learnt from Creating Better Visual Question Answering Datasets | Apr 24, 2017 | Multiple-choiceQuestion Answering | —Unverified | 0 |
| Learning to Reason: End-to-End Module Networks for Visual Question Answering | Apr 18, 2017 | Visual DialogVisual Question Answering | CodeCode Available | 0 |
| TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering | Apr 14, 2017 | Question AnsweringVisual Question Answering | CodeCode Available | 0 |
| ShapeWorld - A new test methodology for multimodal language understanding | Apr 14, 2017 | Multimodal Deep LearningVisual Question Answering | CodeCode Available | 0 |
| What's in a Question: Using Visual Questions as a Form of Supervision | Apr 12, 2017 | Data AugmentationForm | CodeCode Available | 0 |
| Show, Ask, Attend, and Answer: A Strong Baseline For Visual Question Answering | Apr 11, 2017 | Visual Question AnsweringVisual Question Answering (VQA) | CodeCode Available | 0 |
| An Empirical Evaluation of Visual Question Answering for Novel Objects | Apr 8, 2017 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| It Takes Two to Tango: Towards Theory of AI's Mind | Apr 3, 2017 | AttributeQuestion Answering | —Unverified | 0 |
| Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language Tasks | Apr 2, 2017 | Multi-Task LearningQuestion Answering | —Unverified | 0 |
| An Analysis of Visual Question Answering Algorithms | Mar 28, 2017 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Recurrent and Contextual Models for Visual Question Answering | Mar 23, 2017 | DiversityMultiple-choice | —Unverified | 0 |
| Multimodal Compact Bilinear Pooling for Multimodal Neural Machine Translation | Mar 23, 2017 | DecoderMachine Translation | —Unverified | 0 |
| VQABQ: Visual Question Answering by Basic Questions | Mar 19, 2017 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Tree Memory Networks for Modelling Long-term Temporal Dependencies | Mar 12, 2017 | Machine TranslationPart-Of-Speech Tagging | —Unverified | 0 |
| Task-driven Visual Saliency and Attention-based Visual Question Answering | Feb 22, 2017 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions | Dec 16, 2016 | BIG-bench Machine LearningQuestion Answering | —Unverified | 0 |