Visual Question Answering with Question Representation Update (QRU)

2016-12-01NeurIPS 2016Unverified0· sign in to hype

Ruiyu Li, Jiaya Jia

Unverified — Be the first to reproduce this paper.

Abstract

Our method aims at reasoning over natural language questions and visual images. Given a natural language question about an image, our model updates the question representation iteratively by selecting image regions relevant to the query and learns to give the correct answer. Our model contains several reasoning layers, exploiting complex visual relations in the visual question answering (VQA) task. The proposed network is end-to-end trainable through back-propagation, where its weights are initialized using pre-trained convolutional neural network (CNN) and gated recurrent unit (GRU). Our method is evaluated on challenging datasets of COCO-QA and VQA and yields state-of-the-art performance.

Tasks

Question Answering Visual Question Answering Visual Question Answering (VQA)

Visual Question Answering with Question Representation Update (QRU)

Abstract

Tasks

Reproductions