SOTAVerified

Distributional Reinforcement Learning

Value distribution is the distribution of the random return received by a reinforcement learning agent. it been used for a specific purpose such as implementing risk-aware behaviour.

We have random return Z whose expectation is the value Q. This random return is also described by a recursive equation, but one of a distributional nature

Papers

Showing 7180 of 137 papers

TitleStatusHype
The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement Learning0
Risk Perspective Exploration in Distributional Reinforcement Learning0
Robust Reinforcement Learning with Distributional Risk-averse formulation0
IGN : Implicit Generative NetworksCode0
A Simulation Environment and Reinforcement Learning Method for Waste Reduction0
Interpretable Stochastic Model Predictive Control using Distributional Reinforced Estimation for Quadrotor Tracking Systems0
Gamma and Vega Hedging Using Deep Distributional Reinforcement LearningCode1
Adaptive Risk-Tendency: Nano Drone Navigation in Cluttered Environments with Distributional Reinforcement LearningCode1
Distributional Reinforcement Learning for Scheduling of Chemical Production Processes0
Exploration with Multi-Sample Target Values for Distributional Reinforcement Learning0
Show:102550
← PrevPage 8 of 14Next →

No leaderboard results yet.