SOTAVerified

Distributional Reinforcement Learning

Value distribution is the distribution of the random return received by a reinforcement learning agent. it been used for a specific purpose such as implementing risk-aware behaviour.

We have random return Z whose expectation is the value Q. This random return is also described by a recursive equation, but one of a distributional nature

Papers

Showing 121130 of 137 papers

TitleStatusHype
Distributional constrained reinforcement learning for supply chain optimizationCode0
Distributional Bellman Operators over Mean EmbeddingsCode0
Beyond CVaR: Leveraging Static Spectral Risk Measures for Enhanced Decision-Making in Distributional Reinforcement LearningCode0
Implicit Quantile Networks for Distributional Reinforcement LearningCode0
Constrained Reinforcement Learning using Distributional Representation for Trustworthy Quadrotor UAV Tracking ControlCode0
ADDQ: Adaptive Distributional Double Q-LearningCode0
Echoes of Socratic Doubt: Embracing Uncertainty in Calibrated Evidential Reinforcement LearningCode0
CTD4 -- A Deep Continuous Distributional Actor-Critic Agent with a Kalman Fusion of Multiple CriticsCode0
Estimating Risk and Uncertainty in Deep Reinforcement LearningCode0
Estimation and Inference in Distributional Reinforcement LearningCode0
Show:102550
← PrevPage 13 of 14Next →

No leaderboard results yet.