SOTAVerified

Distributional Reinforcement Learning

Value distribution is the distribution of the random return received by a reinforcement learning agent. it been used for a specific purpose such as implementing risk-aware behaviour.

We have random return Z whose expectation is the value Q. This random return is also described by a recursive equation, but one of a distributional nature

Papers

Showing 101137 of 137 papers

TitleStatusHype
A Local Temporal Difference Code for Distributional Reinforcement Learning0
An Analysis of Categorical Distributional Reinforcement Learning0
An Analysis of Quantile Temporal-Difference Learning0
An introduction to reinforcement learning for neuroscience0
A Point-Based Algorithm for Distributional Reinforcement Learning in Partially Observable Domains0
Automatic Risk Adaptation in Distributional Reinforcement Learning0
Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation0
Bayesian Distributional Policy Gradients0
Bellman Diffusion: Generative Modeling as Learning a Linear Operator in the Distribution Space0
Beyond Average Return in Markov Decision Processes0
Bridging Distributional and Risk-sensitive Reinforcement Learning with Provable Regret Bounds0
Conservative Distributional Reinforcement Learning with Safety Constraints0
Controlling Synthetic Characters in Simulations: A Case for Cognitive Architectures and Sigma0
Cramer Type Distances for Learning Gaussian Mixture Models by Gradient Descent0
CTRLS: Chain-of-Thought Reasoning via Latent State-Transition0
Deep Distributional Learning with Non-crossing Quantile Network0
Deep Reinforcement Learning for Artificial Upwelling Energy Management0
Demand-Side Scheduling Based on Multi-Agent Deep Actor-Critic Learning for Smart Grids0
Distributional Perturbation for Efficient Exploration in Distributional Reinforcement Learning0
Distributional Reinforcement Learning-based Energy Arbitrage Strategies in Imbalance Settlement Mechanism0
Distributional Reinforcement Learning for Efficient Exploration0
Distributional Reinforcement Learning for Risk-Sensitive Policies0
Distributional Reinforcement Learning for mmWave Communications with Intelligent Reflectors on a UAV0
Distributional Reinforcement Learning for Scheduling of Chemical Production Processes0
Distributional Reinforcement Learning on Path-dependent Options0
Distributional reinforcement learning with linear function approximation0
Distributional Reinforcement Learning with Ensembles0
Distributional Reinforcement Learning with Monotonic Splines0
Distributional Reinforcement Learning with Dual Expectile-Quantile Regression0
Distributional Reinforcement Learning with Online Risk-awareness Adaption0
Diverse Projection Ensembles for Distributional Reinforcement Learning0
Exploration by Distributional Reinforcement Learning0
Exploration with Multi-Sample Target Values for Distributional Reinforcement Learning0
Exploring the Robustness of Distributional Reinforcement Learning against Noisy State Observations0
A Finite Sample Analysis of Distributional TD Learning with Linear Function Approximation0
Flow Models for Unbounded and Geometry-Aware Distributional Reinforcement Learning0
Foundations of Multivariate Distributional Reinforcement Learning0
Show:102550
← PrevPage 3 of 3Next →

No leaderboard results yet.