SOTAVerified

Distributional Reinforcement Learning

Value distribution is the distribution of the random return received by a reinforcement learning agent. it been used for a specific purpose such as implementing risk-aware behaviour.

We have random return Z whose expectation is the value Q. This random return is also described by a recursive equation, but one of a distributional nature

Papers

Showing 111120 of 137 papers

TitleStatusHype
Distributional Reinforcement Learning for Energy-Based Sequential ModelsCode0
Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination DynamicsCode0
GAN Q-learningCode0
Distributional Reinforcement Learning for Multi-Dimensional Reward FunctionsCode0
The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement LearningCode0
QUOTA: The Quantile Option Architecture for Reinforcement LearningCode0
Distributional Reinforcement Learning with Regularized Wasserstein LossCode0
Value-Distributional Model-Based Reinforcement LearningCode0
Distributional Reinforcement Learning with Quantile RegressionCode0
IGN : Implicit Generative NetworksCode0
Show:102550
← PrevPage 12 of 14Next →

No leaderboard results yet.