SOTAVerified

Distributional Reinforcement Learning

Value distribution is the distribution of the random return received by a reinforcement learning agent. it been used for a specific purpose such as implementing risk-aware behaviour.

We have random return Z whose expectation is the value Q. This random return is also described by a recursive equation, but one of a distributional nature

Papers

Showing 101125 of 137 papers

TitleStatusHype
Bayesian Distributional Policy Gradients0
Safe Distributional Reinforcement Learning0
SENTINEL: Taming Uncertainty with Ensemble-based Distributional Reinforcement Learning0
Addressing Inherent Uncertainty: Risk-Sensitive Behavior Generation for Automated Driving using Distributional Reinforcement Learning0
Unifying Cardiovascular Modelling with Deep Reinforcement Learning for Uncertainty Aware Control of Sepsis TreatmentCode1
Controlling Synthetic Characters in Simulations: A Case for Cognitive Architectures and Sigma0
A Distributional Perspective on Actor-Critic Framework0
Distributional Reinforcement Learning for Risk-Sensitive Policies0
Batch-Constrained Distributional Reinforcement Learning for Session-based Recommendation0
A Local Temporal Difference Code for Distributional Reinforcement Learning0
Non-Crossing Quantile Regression for Distributional Reinforcement Learning0
Distributional Reinforcement Learning for mmWave Communications with Intelligent Reflectors on a UAV0
Distributional Reinforcement Learning via Moment MatchingCode1
Implicit Distributional Reinforcement LearningCode1
Demand-Side Scheduling Based on Multi-Agent Deep Actor-Critic Learning for Smart Grids0
Improving Robustness via Risk Averse Distributional Reinforcement Learning0
Distributional Reinforcement Learning with Ensembles0
Millimeter Wave Communications with an Intelligent Reflector: Performance Optimization and Distributional Reinforcement Learning0
Sample-based Distributional Policy Gradient0
Distributional Reinforcement Learning for Energy-Based Sequential ModelsCode0
Fully Parameterized Quantile Function for Distributional Reinforcement LearningCode0
Estimating Risk and Uncertainty in Deep Reinforcement LearningCode0
Stochastically Dominant Distributional Reinforcement Learning0
Distributional Reinforcement Learning for Efficient Exploration0
GAN-powered Deep Distributional Reinforcement Learning for Resource Management in Network Slicing0
Show:102550
← PrevPage 5 of 6Next →

No leaderboard results yet.