SOTAVerified

Generalized Nested Rollout Policy Adaptation

2020-03-22Unverified0· sign in to hype

Tristan Cazenave

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Nested Rollout Policy Adaptation (NRPA) is a Monte Carlo search algorithm for single player games. In this paper we propose to generalize NRPA with a temperature and a bias and to analyze theoretically the algorithms. The generalized algorithm is named GNRPA. Experiments show it improves on NRPA for different application domains: SameGame and the Traveling Salesman Problem with Time Windows.

Tasks

Reproductions