Obstacle Tower: A Generalization Challenge in Vision, Control, and Planning

2019-02-04Code Available0· sign in to hype

Arthur Juliani, Ahmed Khalifa, Vincent-Pierre Berges, Jonathan Harper, Ervin Teng, Hunter Henry, Adam Crespi, Julian Togelius, Danny Lange

arXiv PDF

Code Available — Be the first to reproduce this paper.

Reproduce

Code

github.com/Unity-Technologies/obstacle-tower-env
OfficialIn papernone★ 0
github.com/dazcona/obstacletower
none★ 0
github.com/odokumaci/rainbow-unity-obstacle-tower-challenge
none★ 0

Abstract

The rapid pace of recent research in AI has been driven in part by the presence of fast and challenging simulation environments. These environments often take the form of games; with tasks ranging from simple board games, to competitive video games. We propose a new benchmark - Obstacle Tower: a high fidelity, 3D, 3rd person, procedurally generated environment. An agent playing Obstacle Tower must learn to solve both low-level control and high-level planning problems in tandem while learning from pixels and a sparse reward signal. Unlike other benchmarks such as the Arcade Learning Environment, evaluation of agent performance in Obstacle Tower is based on an agent's ability to perform well on unseen instances of the environment. In this paper we outline the environment and provide a set of baseline results produced by current state-of-the-art Deep RL methods as well as human players. These algorithms fail to produce agents capable of performing near human level.

Tasks

Atari Games Board Games

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
Obstacle Tower (No Gen) fixed	RNB	Score	7	—	Unverified
Obstacle Tower (No Gen) fixed	PPO	Score	5	—	Unverified
Obstacle Tower (No Gen) varied	PPO	Score	1	—	Unverified
Obstacle Tower (No Gen) varied	RNB	Score	4.8	—	Unverified
Obstacle Tower (Strong Gen) fixed	PPO	Score	0.6	—	Unverified
Obstacle Tower (Strong Gen) fixed	RNB	Score	0.6	—	Unverified
Obstacle Tower (Strong Gen) varied	PPO	Score	0.6	—	Unverified
Obstacle Tower (Strong Gen) varied	RNB	Score	0.8	—	Unverified
Obstacle Tower (Weak Gen) fixed	RNB	Score	1	—	Unverified
Obstacle Tower (Weak Gen) fixed	PPO	Score	1.2	—	Unverified
Obstacle Tower (Weak Gen) varied	RNB	Score	3.4	—	Unverified
Obstacle Tower (Weak Gen) varied	PPO	Score	0.8	—	Unverified

Obstacle Tower: A Generalization Challenge in Vision, Control, and Planning

Code

Abstract

Tasks

Benchmark Results

Reproductions