SOTAVerified

Distributional Reinforcement Learning

Value distribution is the distribution of the random return received by a reinforcement learning agent. it been used for a specific purpose such as implementing risk-aware behaviour.

We have random return Z whose expectation is the value Q. This random return is also described by a recursive equation, but one of a distributional nature

Papers

Showing 7180 of 137 papers

TitleStatusHype
PG-Rainbow: Using Distributional Reinforcement Learning in Policy Gradient Methods0
Pitfall of Optimism: Distributional Reinforcement Learning by Randomizing Risk Criterion0
Policy Evaluation in Distributional LQR0
Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence0
Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation0
Risk-averse policies for natural gas futures trading using distributional reinforcement learning0
Risk Perspective Exploration in Distributional Reinforcement Learning0
Robustness and risk management via distributional dynamic programming0
Robust Probabilistic Model Checking with Continuous Reward Domains0
Robust Reinforcement Learning with Distributional Risk-averse formulation0
Show:102550
← PrevPage 8 of 14Next →

No leaderboard results yet.