SOTAVerified

Distributional Reinforcement Learning

Value distribution is the distribution of the random return received by a reinforcement learning agent. it been used for a specific purpose such as implementing risk-aware behaviour.

We have random return Z whose expectation is the value Q. This random return is also described by a recursive equation, but one of a distributional nature

Papers

Showing 5160 of 137 papers

TitleStatusHype
A Comparative Analysis of Expected and Distributional Reinforcement Learning0
Demand-Side Scheduling Based on Multi-Agent Deep Actor-Critic Learning for Smart Grids0
Distributional Reinforcement Learning with Dual Expectile-Quantile Regression0
Distributional Reinforcement Learning with Online Risk-awareness Adaption0
Deep Reinforcement Learning for Artificial Upwelling Energy Management0
Cramer Type Distances for Learning Gaussian Mixture Models by Gradient Descent0
Addressing Inherent Uncertainty: Risk-Sensitive Behavior Generation for Automated Driving using Distributional Reinforcement Learning0
Exploration by Distributional Reinforcement Learning0
CTRLS: Chain-of-Thought Reasoning via Latent State-Transition0
A Finite Sample Analysis of Distributional TD Learning with Linear Function Approximation0
Show:102550
← PrevPage 6 of 14Next →

No leaderboard results yet.