Value Iteration Networks

2016-02-09NeurIPS 2016Code Available0· sign in to hype

Aviv Tamar, Yi Wu, Garrett Thomas, Sergey Levine, Pieter Abbeel

Code Available — Be the first to reproduce this paper.

Code

github.com/avivt/VIN
OfficialIn paperpytorch★ 0
github.com/zuoxingdong/VIN_PyTorch_Visdom
pytorch★ 224
github.com/sufengniu/GVIN
tf★ 24
github.com/TheAbhiKumar/tensorflow-value-iteration-networks
tf★ 0
github.com/Dungyichao/Electric-Vehicle-Route-Planning-on-Google-Map-Reinforcement-Learning
tf★ 0
github.com/onlytailei/Value-Iteration-Networks-PyTorch
pytorch★ 0
github.com/LiorAl/GymValueIterationNetworks
pytorch★ 0
github.com/kentsommer/pytorch-value-iteration-networks
pytorch★ 0

Abstract

We introduce the value iteration network (VIN): a fully differentiable neural network with a `planning module' embedded within. VINs can learn to plan, and are suitable for predicting outcomes that involve planning-based reasoning, such as policies for reinforcement learning. Key to our approach is a novel differentiable approximation of the value-iteration algorithm, which can be represented as a convolutional neural network, and trained end-to-end using standard backpropagation. We evaluate VIN based policies on discrete and continuous path-planning domains, and on a natural-language based search task. We show that by learning an explicit planning computation, VIN policies generalize better to new, unseen domains.

Tasks

reinforcement-learning Reinforcement Learning Reinforcement Learning (RL)

Value Iteration Networks

Code

Abstract

Tasks

Reproductions