SOTAVerified

Distributional Reinforcement Learning

Value distribution is the distribution of the random return received by a reinforcement learning agent. it been used for a specific purpose such as implementing risk-aware behaviour.

We have random return Z whose expectation is the value Q. This random return is also described by a recursive equation, but one of a distributional nature

Papers

Showing 110 of 137 papers

TitleStatusHype
A Distributional Analogue to the Successor RepresentationCode1
Trust Region-Based Safe Distributional Reinforcement Learning for Multiple ConstraintsCode1
Risk-Sensitive Policy with Distributional Reinforcement LearningCode1
Intelligent Resource Allocation in Joint Radar-Communication With Graph Neural NetworksCode1
Gamma and Vega Hedging Using Deep Distributional Reinforcement LearningCode1
Adaptive Risk-Tendency: Nano Drone Navigation in Cluttered Environments with Distributional Reinforcement LearningCode1
Conservative Offline Distributional Reinforcement LearningCode1
Distributional Reinforcement Learning with Unconstrained Monotonic Neural NetworksCode1
Unifying Cardiovascular Modelling with Deep Reinforcement Learning for Uncertainty Aware Control of Sepsis TreatmentCode1
Distributional Reinforcement Learning via Moment MatchingCode1
Show:102550
← PrevPage 1 of 14Next →

No leaderboard results yet.