SOTAVerified

Distributional Reinforcement Learning

Value distribution is the distribution of the random return received by a reinforcement learning agent. it been used for a specific purpose such as implementing risk-aware behaviour.

We have random return Z whose expectation is the value Q. This random return is also described by a recursive equation, but one of a distributional nature

Papers

Showing 51100 of 137 papers

TitleStatusHype
Robustness and risk management via distributional dynamic programming0
Robust Probabilistic Model Checking with Continuous Reward Domains0
Robust Reinforcement Learning with Distributional Risk-averse formulation0
Safe Distributional Reinforcement Learning0
Sample-based Distributional Policy Gradient0
Second-Order Bounds for [0,1]-Valued Regression via Betting Loss0
SENTINEL: Taming Uncertainty with Ensemble-based Distributional Reinforcement Learning0
Statistical Efficiency of Distributional Temporal Difference Learning and Freedman's Inequality in Hilbert Spaces0
Statistics and Samples in Distributional Reinforcement Learning0
Stochastically Dominant Distributional Reinforcement Learning0
The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement Learning0
The Reactor: A fast and sample-efficient Actor-Critic agent for Reinforcement Learning0
The Statistical Benefits of Quantile Temporal-Difference Learning for Value Estimation0
Toward Risk-based Optimistic Exploration for Cooperative Multi-Agent Reinforcement Learning0
The Benefits of Being Categorical Distributional: Uncertainty-aware Regularized Exploration in Reinforcement Learning0
Towards Understanding Distributional Reinforcement Learning: Regularization, Optimization, Acceleration and Sinkhorn Algorithm0
Uncertainty-Aware Transient Stability-Constrained Preventive Redispatch: A Distributional Reinforcement Learning Approach0
Distributional Perturbation for Efficient Exploration in Distributional Reinforcement Learning0
Distributional Reinforcement Learning-based Energy Arbitrage Strategies in Imbalance Settlement Mechanism0
Distributional Reinforcement Learning for Efficient Exploration0
Distributional Reinforcement Learning for Risk-Sensitive Policies0
Distributional Reinforcement Learning for mmWave Communications with Intelligent Reflectors on a UAV0
Distributional Reinforcement Learning for Scheduling of Chemical Production Processes0
Distributional Reinforcement Learning on Path-dependent Options0
Distributional reinforcement learning with linear function approximation0
Distributional Reinforcement Learning with Ensembles0
Distributional Reinforcement Learning with Monotonic Splines0
Distributional Reinforcement Learning with Dual Expectile-Quantile Regression0
Distributional Reinforcement Learning with Online Risk-awareness Adaption0
Diverse Projection Ensembles for Distributional Reinforcement Learning0
Exploration by Distributional Reinforcement Learning0
Exploration with Multi-Sample Target Values for Distributional Reinforcement Learning0
Exploring the Robustness of Distributional Reinforcement Learning against Noisy State Observations0
A Finite Sample Analysis of Distributional TD Learning with Linear Function Approximation0
Flow Models for Unbounded and Geometry-Aware Distributional Reinforcement Learning0
Foundations of Multivariate Distributional Reinforcement Learning0
GAN-powered Deep Distributional Reinforcement Learning for Resource Management in Network Slicing0
A Simulation Environment and Reinforcement Learning Method for Waste Reduction0
Hedging and Pricing Structured Products Featuring Multiple Underlying Assets0
How Does Return Distribution in Distributional Reinforcement Learning Help Optimization?0
Improving Robustness via Risk Averse Distributional Reinforcement Learning0
Improving the generalizability and robustness of large-scale traffic signal control0
Interpretable Stochastic Model Predictive Control using Distributional Reinforced Estimation for Quadrotor Tracking Systems0
Invariance to Quantile Selection in Distributional Continuous Control0
Is Risk-Sensitive Reinforcement Learning Properly Resolved?0
Learning Risk-Aware Quadrupedal Locomotion using Distributional Reinforcement Learning0
Millimeter Wave Communications with an Intelligent Reflector: Performance Optimization and Distributional Reinforcement Learning0
Minimizing Safety Interference for Safe and Comfortable Automated Driving with Distributional Reinforcement Learning0
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning0
More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning0
Show:102550
← PrevPage 2 of 3Next →

No leaderboard results yet.