SOTAVerified

Distributional Reinforcement Learning

Value distribution is the distribution of the random return received by a reinforcement learning agent. it been used for a specific purpose such as implementing risk-aware behaviour.

We have random return Z whose expectation is the value Q. This random return is also described by a recursive equation, but one of a distributional nature

Papers

Showing 126137 of 137 papers

TitleStatusHype
Statistics and Samples in Distributional Reinforcement Learning0
Distributional reinforcement learning with linear function approximation0
A Comparative Analysis of Expected and Distributional Reinforcement Learning0
Information-Directed Exploration for Deep Reinforcement LearningCode0
QUOTA: The Quantile Option Architecture for Reinforcement LearningCode0
Implicit Quantile Networks for Distributional Reinforcement LearningCode0
Nonlinear Distributional Gradient Temporal-Difference Learning0
GAN Q-learningCode0
Exploration by Distributional Reinforcement Learning0
An Analysis of Categorical Distributional Reinforcement Learning0
Distributional Reinforcement Learning with Quantile RegressionCode0
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning0
Show:102550
← PrevPage 6 of 6Next →

No leaderboard results yet.