SOTAVerified

Distributional Reinforcement Learning

Value distribution is the distribution of the random return received by a reinforcement learning agent. it been used for a specific purpose such as implementing risk-aware behaviour.

We have random return Z whose expectation is the value Q. This random return is also described by a recursive equation, but one of a distributional nature

Papers

Showing 111120 of 137 papers

TitleStatusHype
Non-Crossing Quantile Regression for Distributional Reinforcement Learning0
Distributional Reinforcement Learning for mmWave Communications with Intelligent Reflectors on a UAV0
Distributional Reinforcement Learning via Moment MatchingCode1
Implicit Distributional Reinforcement LearningCode1
Demand-Side Scheduling Based on Multi-Agent Deep Actor-Critic Learning for Smart Grids0
Improving Robustness via Risk Averse Distributional Reinforcement Learning0
Distributional Reinforcement Learning with Ensembles0
Millimeter Wave Communications with an Intelligent Reflector: Performance Optimization and Distributional Reinforcement Learning0
Sample-based Distributional Policy Gradient0
Distributional Reinforcement Learning for Energy-Based Sequential ModelsCode0
Show:102550
← PrevPage 12 of 14Next →

No leaderboard results yet.