SOTAVerified

Distributional Reinforcement Learning

Value distribution is the distribution of the random return received by a reinforcement learning agent. it been used for a specific purpose such as implementing risk-aware behaviour.

We have random return Z whose expectation is the value Q. This random return is also described by a recursive equation, but one of a distributional nature

Papers

Showing 91100 of 137 papers

TitleStatusHype
Towards Understanding Distributional Reinforcement Learning: Regularization, Optimization, Acceleration and Sinkhorn Algorithm0
Exploring the Robustness of Distributional Reinforcement Learning against Noisy State Observations0
Distributional Reinforcement Learning with Monotonic Splines0
Exploring the Training Robustness of Distributional Reinforcement Learning against Noisy State ObservationsCode0
Minimizing Safety Interference for Safe and Comfortable Automated Driving with Distributional Reinforcement Learning0
Conservative Offline Distributional Reinforcement LearningCode1
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning0
Automatic Risk Adaptation in Distributional Reinforcement Learning0
Distributional Reinforcement Learning with Unconstrained Monotonic Neural NetworksCode1
Non-decreasing Quantile Function Network with Efficient Exploration for Distributional Reinforcement Learning0
Show:102550
← PrevPage 10 of 14Next →

No leaderboard results yet.