SOTAVerified

Distributional Reinforcement Learning

Value distribution is the distribution of the random return received by a reinforcement learning agent. it been used for a specific purpose such as implementing risk-aware behaviour.

We have random return Z whose expectation is the value Q. This random return is also described by a recursive equation, but one of a distributional nature

Papers

Showing 8190 of 137 papers

TitleStatusHype
Exploration by Distributional Reinforcement Learning0
Exploration with Multi-Sample Target Values for Distributional Reinforcement Learning0
Exploring the Robustness of Distributional Reinforcement Learning against Noisy State Observations0
A Finite Sample Analysis of Distributional TD Learning with Linear Function Approximation0
Flow Models for Unbounded and Geometry-Aware Distributional Reinforcement Learning0
Foundations of Multivariate Distributional Reinforcement Learning0
GAN-powered Deep Distributional Reinforcement Learning for Resource Management in Network Slicing0
A Simulation Environment and Reinforcement Learning Method for Waste Reduction0
Hedging and Pricing Structured Products Featuring Multiple Underlying Assets0
How Does Return Distribution in Distributional Reinforcement Learning Help Optimization?0
Show:102550
← PrevPage 9 of 14Next →

No leaderboard results yet.