SOTAVerified

Distributional Reinforcement Learning

Value distribution is the distribution of the random return received by a reinforcement learning agent. it been used for a specific purpose such as implementing risk-aware behaviour.

We have random return Z whose expectation is the value Q. This random return is also described by a recursive equation, but one of a distributional nature

Papers

Showing 101110 of 137 papers

TitleStatusHype
Bayesian Distributional Policy Gradients0
Safe Distributional Reinforcement Learning0
SENTINEL: Taming Uncertainty with Ensemble-based Distributional Reinforcement Learning0
Addressing Inherent Uncertainty: Risk-Sensitive Behavior Generation for Automated Driving using Distributional Reinforcement Learning0
Unifying Cardiovascular Modelling with Deep Reinforcement Learning for Uncertainty Aware Control of Sepsis TreatmentCode1
Controlling Synthetic Characters in Simulations: A Case for Cognitive Architectures and Sigma0
A Distributional Perspective on Actor-Critic Framework0
Distributional Reinforcement Learning for Risk-Sensitive Policies0
Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation0
A Local Temporal Difference Code for Distributional Reinforcement Learning0
Show:102550
← PrevPage 11 of 14Next →

No leaderboard results yet.