SOTAVerified

Distributional Reinforcement Learning

Value distribution is the distribution of the random return received by a reinforcement learning agent. it been used for a specific purpose such as implementing risk-aware behaviour.

We have random return Z whose expectation is the value Q. This random return is also described by a recursive equation, but one of a distributional nature

Papers

Showing 101110 of 137 papers

TitleStatusHype
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning0
Automatic Risk Adaptation in Distributional Reinforcement Learning0
Non-decreasing Quantile Function Network with Efficient Exploration for Distributional Reinforcement Learning0
Bayesian Distributional Policy Gradients0
Safe Distributional Reinforcement Learning0
SENTINEL: Taming Uncertainty with Ensemble-based Distributional Reinforcement Learning0
Addressing Inherent Uncertainty: Risk-Sensitive Behavior Generation for Automated Driving using Distributional Reinforcement Learning0
Controlling Synthetic Characters in Simulations: A Case for Cognitive Architectures and Sigma0
Distributional Reinforcement Learning for Risk-Sensitive Policies0
A Distributional Perspective on Actor-Critic Framework0
Show:102550
← PrevPage 11 of 14Next →

No leaderboard results yet.