SOTAVerified

Distributional Reinforcement Learning

Value distribution is the distribution of the random return received by a reinforcement learning agent. it been used for a specific purpose such as implementing risk-aware behaviour.

We have random return Z whose expectation is the value Q. This random return is also described by a recursive equation, but one of a distributional nature

Papers

Showing 101110 of 137 papers

TitleStatusHype
A Local Temporal Difference Code for Distributional Reinforcement Learning0
An Analysis of Categorical Distributional Reinforcement Learning0
An Analysis of Quantile Temporal-Difference Learning0
An introduction to reinforcement learning for neuroscience0
A Point-Based Algorithm for Distributional Reinforcement Learning in Partially Observable Domains0
Automatic Risk Adaptation in Distributional Reinforcement Learning0
Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation0
Bayesian Distributional Policy Gradients0
Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space0
Beyond Average Return in Markov Decision Processes0
Show:102550
← PrevPage 11 of 14Next →

No leaderboard results yet.