SOTAVerified

Distributional Reinforcement Learning

Value distribution is the distribution of the random return received by a reinforcement learning agent. it been used for a specific purpose such as implementing risk-aware behaviour.

We have random return Z whose expectation is the value Q. This random return is also described by a recursive equation, but one of a distributional nature

Papers

Showing 1120 of 137 papers

TitleStatusHype
Adaptive Risk-Tendency: Nano Drone Navigation in Cluttered Environments with Distributional Reinforcement LearningCode1
ADDQ: Adaptive Distributional Double Q-LearningCode0
A Cramér Distance perspective on Quantile Regression based Distributional Reinforcement LearningCode0
Distributional Reinforcement Learning with Quantile RegressionCode0
Distributional Reinforcement Learning with Regularized Wasserstein LossCode0
Distributional Off-policy Evaluation with Bellman Residual MinimizationCode0
Distributional constrained reinforcement learning for supply chain optimizationCode0
Distributional Model Equivalence for Risk-Sensitive Reinforcement LearningCode0
Distributional Reinforcement Learning for Energy-Based Sequential ModelsCode0
CTD4 -- A Deep Continuous Distributional Actor-Critic Agent with a Kalman Fusion of Multiple CriticsCode0
Show:102550
← PrevPage 2 of 14Next →

No leaderboard results yet.