Weighted Entropy Modification for Soft Actor-Critic

2020-11-18Unverified0· sign in to hype

Yizhou Zhao, Song-Chun Zhu

Unverified — Be the first to reproduce this paper.

Abstract

We generalize the existing principle of the maximum Shannon entropy in reinforcement learning (RL) to weighted entropy by characterizing the state-action pairs with some qualitative weights, which can be connected with prior knowledge, experience replay, and evolution process of the policy. We propose an algorithm motivated for self-balancing exploration with the introduced weight function, which leads to state-of-the-art performance on Mujoco tasks despite its simplicity in implementation.

Tasks

MuJoCo reinforcement-learning Reinforcement Learning Reinforcement Learning (RL)

Weighted Entropy Modification for Soft Actor-Critic

Abstract

Tasks

Reproductions