SOTAVerified

Distributional Reinforcement Learning

Value distribution is the distribution of the random return received by a reinforcement learning agent. it been used for a specific purpose such as implementing risk-aware behaviour.

We have random return Z whose expectation is the value Q. This random return is also described by a recursive equation, but one of a distributional nature

Papers

Showing 150 of 137 papers

TitleStatusHype
Second-Order Bounds for [0,1]-Valued Regression via Betting Loss0
Distributional Reinforcement Learning on Path-dependent Options0
CTRLS: Chain-of-Thought Reasoning via Latent State-Transition0
ADDQ: Adaptive Distributional Double Q-LearningCode0
A Point-Based Algorithm for Distributional Reinforcement Learning in Partially Observable Domains0
Flow Models for Unbounded and Geometry-Aware Distributional Reinforcement Learning0
Deep Distributional Learning with Non-crossing Quantile Network0
Offline and Distributional Reinforcement Learning for Wireless Communications0
RIZE: Regularized Imitation Learning via Distributional Reinforcement LearningCode0
Adaptive Nesterov Accelerated Distributional Deep Hedging for Efficient Volatility Risk Management0
A Finite Sample Analysis of Distributional TD Learning with Linear Function Approximation0
Robust Probabilistic Model Checking with Continuous Reward Domains0
Tackling Uncertainties in Multi-Agent Reinforcement Learning through Integration of Agent Termination DynamicsCode0
Risk-averse policies for natural gas futures trading using distributional reinforcement learning0
Beyond CVaR: Leveraging Static Spectral Risk Measures for Enhanced Decision-Making in Distributional Reinforcement LearningCode0
Hedging and Pricing Structured Products Featuring Multiple Underlying Assets0
Action Gaps and Advantages in Continuous-Time Distributional Reinforcement Learning0
Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space0
Offline and Distributional Reinforcement Learning for Radio Resource Management0
Foundations of Multivariate Distributional Reinforcement Learning0
EX-DRL: Hedging Against Heavy Losses with EXtreme Distributional Reinforcement LearningCode0
Bellman Unbiasedness: Toward Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation0
On Policy Evaluation Algorithms in Distributional Reinforcement Learning0
PG-Rainbow: Using Distributional Reinforcement Learning in Policy Gradient Methods0
Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence0
CTD4 -- A Deep Continuous Distributional Actor-Critic Agent with a Kalman Fusion of Multiple CriticsCode0
Statistical Efficiency of Distributional Temporal Difference Learning and Freedman's Inequality in Hilbert Spaces0
Provable Risk-Sensitive Distributional Reinforcement Learning with General Function Approximation0
Uncertainty-Aware Transient Stability-Constrained Preventive Redispatch: A Distributional Reinforcement Learning Approach0
A Distributional Analogue to the Successor RepresentationCode1
Near-Minimax-Optimal Distributional Reinforcement Learning with a Generative Model0
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning0
Echoes of Socratic Doubt: Embracing Uncertainty in Calibrated Evidential Reinforcement LearningCode0
Distributional Off-policy Evaluation with Bellman Residual MinimizationCode0
A Robust Quantile Huber Loss With Interpretable Parameter Adjustment In Distributional Reinforcement LearningCode0
Distributional Reinforcement Learning-based Energy Arbitrage Strategies in Imbalance Settlement Mechanism0
Noise Distribution Decomposition based Multi-Agent Distributional Reinforcement Learning0
Distributional Bellman Operators over Mean EmbeddingsCode0
An introduction to reinforcement learning for neuroscience0
Beyond Average Return in Markov Decision Processes0
Pitfall of Optimism: Distributional Reinforcement Learning by Randomizing Risk Criterion0
Distributional Reinforcement Learning with Online Risk-awareness Adaption0
Estimation and Inference in Distributional Reinforcement LearningCode0
Learning Risk-Aware Quadrupedal Locomotion using Distributional Reinforcement Learning0
Deep Reinforcement Learning for Artificial Upwelling Energy Management0
Value-Distributional Model-Based Reinforcement LearningCode0
Variance Control for Distributional Reinforcement LearningCode0
Cramer Type Distances for Learning Gaussian Mixture Models by Gradient Descent0
Distributional Model Equivalence for Risk-Sensitive Reinforcement LearningCode0
Is Risk-Sensitive Reinforcement Learning Properly Resolved?0
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.