Recurrent Models of Visual Attention

2014-06-24NeurIPS 2014Code Available1· sign in to hype

Volodymyr Mnih, Nicolas Heess, Alex Graves, Koray Kavukcuoglu

Code Available — Be the first to reproduce this paper.

Code

github.com/abdosharaf98/active-sensing-paper
pytorch★ 10
github.com/amobiny/Recurrent_Attention_Model
tf★ 8
github.com/ulstu/robotics_ml
none★ 0
github.com/conan7882/recurrent-attention-model
tf★ 0
github.com/Sooram/cnn
tf★ 0
github.com/kevinzakka/recurrent-visual-attention
pytorch★ 0
github.com/johnrobinsn/catch
pytorch★ 0
github.com/mengdi-li/robotic-occlusion-reasoning
pytorch★ 0
github.com/tianyu-tristan/Visual-Attention-Model
tf★ 0
github.com/qingzew/tensorflow-ram
tf★ 0

Abstract

Applying convolutional neural networks to large images is computationally expensive because the amount of computation scales linearly with the number of image pixels. We present a novel recurrent neural network model that is capable of extracting information from an image or video by adaptively selecting a sequence of regions or locations and only processing the selected regions at high resolution. Like convolutional neural networks, the proposed model has a degree of translation invariance built-in, but the amount of computation it performs can be controlled independently of the input image size. While the model is non-differentiable, it can be trained using reinforcement learning methods to learn task-specific policies. We evaluate our model on several image classification tasks, where it significantly outperforms a convolutional neural network baseline on cluttered images, and on a dynamic visual control problem, where it learns to track a simple object without an explicit training signal for doing so.

Tasks

Hard Attention image-classification Image Classification Reinforcement Learning Translation

Recurrent Models of Visual Attention

Code

Abstract

Tasks

Reproductions