SOTAVerified

Distributional Reinforcement Learning

Value distribution is the distribution of the random return received by a reinforcement learning agent. it been used for a specific purpose such as implementing risk-aware behaviour.

We have random return Z whose expectation is the value Q. This random return is also described by a recursive equation, but one of a distributional nature

Papers

Showing 131137 of 137 papers

TitleStatusHype
EX-DRL: Hedging Against Heavy Losses with EXtreme Distributional Reinforcement LearningCode0
Information-Directed Exploration for Deep Reinforcement LearningCode0
A Robust Quantile Huber Loss With Interpretable Parameter Adjustment In Distributional Reinforcement LearningCode0
Exploring the Training Robustness of Distributional Reinforcement Learning against Noisy State ObservationsCode0
RIZE: Regularized Imitation Learning via Distributional Reinforcement LearningCode0
Two steps to risk sensitivityCode0
Variance Control for Distributional Reinforcement LearningCode0
Show:102550
← PrevPage 14 of 14Next →

No leaderboard results yet.