SOTAVerified

Distributional Reinforcement Learning

Value distribution is the distribution of the random return received by a reinforcement learning agent. it been used for a specific purpose such as implementing risk-aware behaviour.

We have random return Z whose expectation is the value Q. This random return is also described by a recursive equation, but one of a distributional nature

Papers

Showing 2130 of 137 papers

TitleStatusHype
Adaptive Nesterov Accelerated Distributional Deep Hedging for Efficient Volatility Risk Management0
A Finite Sample Analysis of Distributional TD Learning with Linear Function Approximation0
Robust Probabilistic Model Checking with Continuous Reward Domains0
Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination DynamicsCode0
Risk-averse policies for natural gas futures trading using distributional reinforcement learning0
Beyond CVaR: Leveraging Static Spectral Risk Measures for Enhanced Decision-Making in Distributional Reinforcement LearningCode0
Hedging and Pricing Structured Products Featuring Multiple Underlying Assets0
Action Gaps and Advantages in Continuous-Time Distributional Reinforcement Learning0
Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space0
Offline and Distributional Reinforcement Learning for Radio Resource Management0
Show:102550
← PrevPage 3 of 14Next →

No leaderboard results yet.