Fitted Q-Learning for Relational Domains

2020-06-10Unverified0· sign in to hype

Srijita Das, Sriraam Natarajan, Kaushik Roy, Ronald Parr, Kristian Kersting

Unverified — Be the first to reproduce this paper.

Abstract

We consider the problem of Approximate Dynamic Programming in relational domains. Inspired by the success of fitted Q-learning methods in propositional settings, we develop the first relational fitted Q-learning algorithms by representing the value function and Bellman residuals. When we fit the Q-functions, we show how the two steps of Bellman operator; application and projection steps can be performed using a gradient-boosting technique. Our proposed framework performs reasonably well on standard domains without using domain models and using fewer training trajectories.

Tasks

Q-Learning

Fitted Q-Learning for Relational Domains

Abstract

Tasks

Reproductions