SOTAVerified

VIBIKNet: Visual Bidirectional Kernelized Network for Visual Question Answering

2016-12-12Code Available0· sign in to hype

Marc Bolaños, Álvaro Peris, Francisco Casacuberta, Petia Radeva

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

In this paper, we address the problem of visual question answering by proposing a novel model, called VIBIKNet. Our model is based on integrating Kernelized Convolutional Neural Networks and Long-Short Term Memory units to generate an answer given a question about an image. We prove that VIBIKNet is an optimal trade-off between accuracy and computational load, in terms of memory and time consumption. We validate our method on the VQA challenge dataset and compare it to the top performing methods in order to illustrate its performance and speed.

Tasks

Reproductions