Synthesizing Safe Policies under Probabilistic Constraints with Reinforcement Learning and Bayesian Model Checking

2020-05-08Unverified0· sign in to hype

Lenz Belzner, Martin Wirsing

Unverified — Be the first to reproduce this paper.

Abstract

We propose to leverage epistemic uncertainty about constraint satisfaction of a reinforcement learner in safety critical domains. We introduce a framework for specification of requirements for reinforcement learners in constrained settings, including confidence about results. We show that an agent's confidence in constraint satisfaction provides a useful signal for balancing optimization and safety in the learning process.

Tasks

reinforcement-learning Reinforcement Learning (RL)

Synthesizing Safe Policies under Probabilistic Constraints with Reinforcement Learning and Bayesian Model Checking

Abstract

Tasks

Reproductions