Evaluating the Performance of Reinforcement Learning Algorithms

2020-06-30ICML 2020Code Available1· sign in to hype

Scott M. Jordan, Yash Chandak, Daniel Cohen, Mengxue Zhang, Philip S. Thomas

Code Available — Be the first to reproduce this paper.

Code

github.com/ScottJordan/EvaluationOfRLAlgs
OfficialIn papernone★ 27

Abstract

Performance evaluations are critical for quantifying algorithmic advances in reinforcement learning. Recent reproducibility analyses have shown that reported performance results are often inconsistent and difficult to replicate. In this work, we argue that the inconsistency of performance stems from the use of flawed evaluation metrics. Taking a step towards ensuring that reported results are consistent, we propose a new comprehensive evaluation methodology for reinforcement learning algorithms that produces reliable measurements of performance both on a single environment and when aggregated across environments. We demonstrate this method by evaluating a broad class of reinforcement learning algorithms on standard benchmark tasks.

Tasks

reinforcement-learning Reinforcement Learning Reinforcement Learning (RL)

Evaluating the Performance of Reinforcement Learning Algorithms

Code

Abstract

Tasks

Reproductions