A Cantor-Kantorovich Metric Between Markov Decision Processes with Application to Transfer Learning

2024-07-11Unverified0· sign in to hype

Adrien Banse, Venkatraman Renganathan, Raphaël M. Jungers

Unverified — Be the first to reproduce this paper.

Abstract

We extend the notion of Cantor-Kantorovich distance between Markov chains introduced by (Banse et al., 2023) in the context of Markov Decision Processes (MDPs). The proposed metric is well-defined and can be efficiently approximated given a finite horizon. Then, we provide numerical evidences that the latter metric can lead to interesting applications in the field of reinforcement learning. In particular, we show that it could be used for forecasting the performance of transfer learning algorithms.

Tasks

reinforcement-learning Reinforcement Learning Transfer Learning

A Cantor-Kantorovich Metric Between Markov Decision Processes with Application to Transfer Learning

Abstract

Tasks

Reproductions