Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering Jul 25, 2017 Image Captioning Visual Question Answering
Code Code Available 1Improved Bilinear Pooling with CNNs Jul 21, 2017 GPU Question Answering
— Unverified 0Video Question Answering via Attribute-Augmented Attention Network Learning Jul 20, 2017 Attribute Information Retrieval
— Unverified 0Visual Question Answering with Memory-Augmented Networks Jul 17, 2017 Question Answering Visual Question Answering
— Unverified 0Effective Approaches to Batch Parallelization for Dynamic Neural Network Architectures Jul 8, 2017 Mixture-of-Experts Question Answering
Code Code Available 0Modulating early visual processing by language Jul 2, 2017 Question Answering Visual Question Answering
Code Code Available 0Multi-Level Attention Networks for Visual Question Answering Jul 1, 2017 Question Answering Visual Question Answering
— Unverified 0Are You Smarter Than a Sixth Grader? Textbook Question Answering for Multimodal Machine Comprehension Jul 1, 2017 Question Answering Reading Comprehension
— Unverified 0Kernel Pooling for Convolutional Neural Networks Jul 1, 2017 Face Recognition Fine-Grained Visual Categorization
— Unverified 0Knowledge Acquisition for Visual Question Answering via Iterative Querying Jul 1, 2017 Question Answering Visual Question Answering
— Unverified 0Segmentation Guided Attention Networks for Visual Question Answering Jul 1, 2017 Common Sense Reasoning Question Answering
— Unverified 0Multimodal Machine Learning: Integrating Language, Vision and Speech Jul 1, 2017 Audio-Visual Speech Recognition BIG-bench Machine Learning
— Unverified 0A Corpus of Natural Language for Visual Reasoning Jul 1, 2017 Question Answering Visual Question Answering (VQA)
— Unverified 0Compact Tensor Pooling for Visual Question Answering Jun 20, 2017 Question Answering Visual Question Answering
— Unverified 0A simple neural network module for relational reasoning Jun 5, 2017 Image Retrieval with Multi-Modal Query Question Answering
Code Code Available 0Deep learning evaluation using deep linguistic processing Jun 5, 2017 Deep Learning Multimodal Deep Learning
— Unverified 0MUTAN: Multimodal Tucker Fusion for Visual Question Answering May 18, 2017 Visual Question Answering Visual Question Answering (VQA)
Code Code Available 0ParlAI: A Dialog Research Software Platform May 18, 2017 reinforcement-learning Reinforcement Learning
Code Code Available 1Learning Convolutional Text Representations for Visual Question Answering May 18, 2017 General Classification image-classification
Code Code Available 0Survey of Visual Question Answering: Datasets and Techniques May 10, 2017 Deep Learning Question Answering
— Unverified 0Inferring and Executing Programs for Visual Reasoning May 10, 2017 Visual Question Answering (VQA) Visual Reasoning
Code Code Available 0The Forgettable-Watcher Model for Video Question Answering May 3, 2017 model Question Answering
— Unverified 0The Promise of Premise: Harnessing Question Premises in Visual Question Answering May 1, 2017 Question Answering Relevance Detection
Code Code Available 0Speech-Based Visual Question Answering May 1, 2017 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0C-VQA: A Compositional Split of the Visual Question Answering (VQA) v1.0 Dataset Apr 26, 2017 Question Answering Visual Question Answering
— Unverified 0Being Negative but Constructively: Lessons Learnt from Creating Better Visual Question Answering Datasets Apr 24, 2017 Multiple-choice Question Answering
— Unverified 0Learning to Reason: End-to-End Module Networks for Visual Question Answering Apr 18, 2017 Visual Dialog Visual Question Answering
Code Code Available 0ShapeWorld - A new test methodology for multimodal language understanding Apr 14, 2017 Multimodal Deep Learning Visual Question Answering
Code Code Available 0TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering Apr 14, 2017 Question Answering Visual Question Answering
Code Code Available 0What's in a Question: Using Visual Questions as a Form of Supervision Apr 12, 2017 Data Augmentation Form
Code Code Available 0Show, Ask, Attend, and Answer: A Strong Baseline For Visual Question Answering Apr 11, 2017 Visual Question Answering Visual Question Answering (VQA)
Code Code Available 0An Empirical Evaluation of Visual Question Answering for Novel Objects Apr 8, 2017 Question Answering Visual Question Answering
— Unverified 0It Takes Two to Tango: Towards Theory of AI's Mind Apr 3, 2017 Attribute Question Answering
— Unverified 0Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language Tasks Apr 2, 2017 Multi-Task Learning Question Answering
— Unverified 0An Analysis of Visual Question Answering Algorithms Mar 28, 2017 Question Answering Visual Question Answering
— Unverified 0Recurrent and Contextual Models for Visual Question Answering Mar 23, 2017 Diversity Multiple-choice
— Unverified 0Multimodal Compact Bilinear Pooling for Multimodal Neural Machine Translation Mar 23, 2017 Decoder Machine Translation
— Unverified 0Learning Cooperative Visual Dialog Agents with Deep Reinforcement Learning Mar 20, 2017 Deep Reinforcement Learning reinforcement-learning
Code Code Available 1VQABQ: Visual Question Answering by Basic Questions Mar 19, 2017 Question Answering Visual Question Answering
— Unverified 0End-to-end optimization of goal-driven and visually grounded dialogue systems Mar 15, 2017 Decoder Deep Reinforcement Learning
Code Code Available 0Tree Memory Networks for Modelling Long-term Temporal Dependencies Mar 12, 2017 Machine Translation Part-Of-Speech Tagging
— Unverified 0Task-driven Visual Saliency and Attention-based Visual Question Answering Feb 22, 2017 Question Answering Visual Question Answering
— Unverified 0Vision and Language Integration: Moving beyond Objects Jan 1, 2017 Action Classification Image Captioning
— Unverified 0CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning Dec 20, 2016 Diagnostic Question Answering
Code Code Available 1The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions Dec 16, 2016 BIG-bench Machine Learning Question Answering
— Unverified 0Attentive Explanations: Justifying Decisions and Pointing to the Evidence Dec 14, 2016 Decision Making Question Answering
— Unverified 0VIBIKNet: Visual Bidirectional Kernelized Network for Visual Question Answering Dec 12, 2016 Question Answering Visual Question Answering
Code Code Available 0Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering Dec 2, 2016 Visual Question Answering Visual Question Answering (VQA)
Code Code Available 0Visual Question Answering with Question Representation Update (QRU) Dec 1, 2016 Question Answering Visual Question Answering
— Unverified 0The Development of Multimodal Lexical Resources Dec 1, 2016 Question Answering Visual Question Answering (VQA)
— Unverified 0