SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 71517175 of 15113 papers

TitleStatusHype
Explainable Deep Reinforcement Learning: State of the Art and Challenges0
Explainable Deep Reinforcement Learning Using Introspection in a Non-episodic Task0
Explainable Reinforcement Learning: A Survey0
Explainable Reinforcement Learning for Broad-XAI: A Conceptual Framework and Survey0
Explainable Reinforcement Learning on Financial Stock Trading using SHAP0
Explainable Reinforcement Learning Through Goal-Based Explanations0
Explainable Reinforcement Learning via Temporal Policy Decomposition0
Explainable robotic systems: Understanding goal-driven actions in a reinforcement learning scenario0
Explaining a Deep Reinforcement Learning Docking Agent Using Linear Model Trees with User Adapted Visualization0
Explaining Agent's Decision-making in a Hierarchical Reinforcement Learning Scenario0
Explaining Conditions for Reinforcement Learning Behaviors from Real and Imagined Data0
Explaining Deep Reinforcement Learning Agents In The Atari Domain through a Surrogate Model0
Explaining Online Reinforcement Learning Decisions of Self-Adaptive Systems0
Explaining Reinforcement Learning to Mere Mortals: An Empirical Study0
Explanation Augmented Feedback in Human-in-the-Loop Reinforcement Learning0
Explanation of Reinforcement Learning Model in Dynamic Multi-Agent System0
Explicit Explore, Exploit, or Escape (E^4): near-optimal safety-constrained reinforcement learning in polynomial time0
Explicit Lipschitz Value Estimation Enhances Policy Robustness Against Perturbation0
Explicit Mean-Square Error Bounds for Monte-Carlo and Linear Stochastic Approximation0
Explicit Pareto Front Optimization for Constrained Reinforcement Learning0
Explicit Planning for Efficient Exploration in Reinforcement Learning0
Explicit Recall for Efficient Exploration0
Explicit User Manipulation in Reinforcement Learning Based Recommender Systems0
Exploiting Action Impact Regularity and Exogenous State Variables for Offline Reinforcement Learning0
Exploiting Contextual Structure to Generate Useful Auxiliary Tasks0
Show:102550
← PrevPage 287 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified