SOTAVerified

Distributional Reinforcement Learning

Value distribution is the distribution of the random return received by a reinforcement learning agent. it been used for a specific purpose such as implementing risk-aware behaviour.

We have random return Z whose expectation is the value Q. This random return is also described by a recursive equation, but one of a distributional nature

Papers

Showing 91100 of 137 papers

TitleStatusHype
Improving Robustness via Risk Averse Distributional Reinforcement Learning0
Improving the generalizability and robustness of large-scale traffic signal control0
Interpretable Stochastic Model Predictive Control using Distributional Reinforced Estimation for Quadrotor Tracking Systems0
Invariance to Quantile Selection in Distributional Continuous Control0
Is Risk-Sensitive Reinforcement Learning Properly Resolved?0
Learning Risk-Aware Quadrupedal Locomotion using Distributional Reinforcement Learning0
Millimeter Wave Communications with an Intelligent Reflector: Performance Optimization and Distributional Reinforcement Learning0
Minimizing Safety Interference for Safe and Comfortable Automated Driving with Distributional Reinforcement Learning0
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning0
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning0
Show:102550
← PrevPage 10 of 14Next →

No leaderboard results yet.