SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 65516575 of 15113 papers

TitleStatusHype
Distributional Decision Transformer for Hindsight Information Matching0
Distributional Hamilton-Jacobi-Bellman Equations for Continuous-Time Reinforcement Learning0
Distributionally Adaptive Meta Reinforcement Learning0
Distributionally-Constrained Policy Optimization via Unbalanced Optimal Transport0
Distributionally Robust Imitation Learning0
Distributionally Robust Model-Based Offline Reinforcement Learning with Near-Optimal Sample Complexity0
Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation0
Distributionally Robust Reinforcement Learning0
Distributionally Robust Reinforcement Learning with Interactive Data Collection: Fundamental Hardness and Near-Optimal Algorithm0
Distributional Method for Risk Averse Reinforcement Learning0
Distributional Perturbation for Efficient Exploration in Distributional Reinforcement Learning0
Distributional Reinforcement Learning for Efficient Exploration0
Distributional Reinforcement Learning for Risk-Sensitive Policies0
Distributional Reinforcement Learning for mmWave Communications with Intelligent Reflectors on a UAV0
Distributional Reinforcement Learning for Scheduling of Chemical Production Processes0
Distributional reinforcement learning with linear function approximation0
Distributional Reinforcement Learning with Ensembles0
Distributional Reinforcement Learning with Monotonic Splines0
Distributional Reinforcement Learning with Dual Expectile-Quantile Regression0
Distributional Reinforcement Learning with Online Risk-awareness Adaption0
Distributional Reward Decomposition for Reinforcement Learning0
Distributional Robustness and Regularization in Reinforcement Learning0
DSAC: Distributional Soft Actor Critic for Risk-Sensitive Reinforcement Learning0
Distributional Soft Actor-Critic with Harmonic Gradient for Safe and Efficient Autonomous Driving in Multi-lane Scenarios0
Distributive Dynamic Spectrum Access through Deep Reinforcement Learning: A Reservoir Computing Based Approach0
Show:102550
← PrevPage 263 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified