Analysis and Optimization of Deep Counterfactual Value Networks

2018-07-02Unverified0· sign in to hype

Patryk Hopner, Eneldo Loza Mencía

Unverified — Be the first to reproduce this paper.

Abstract

Recently a strong poker-playing algorithm called DeepStack was published, which is able to find an approximate Nash equilibrium during gameplay by using heuristic values of future states predicted by deep neural networks. This paper analyzes new ways of encoding the inputs and outputs of DeepStack's deep counterfactual value networks based on traditional abstraction techniques, as well as an unabstracted encoding, which was able to increase the network's accuracy.

Tasks

counterfactual

Analysis and Optimization of Deep Counterfactual Value Networks

Abstract

Tasks

Reproductions