SOTAVerified

Reinforcement Learning (RL)

Reinforcement Learning (RL) involves training an agent to take actions in an environment to maximize a cumulative reward signal. The agent interacts with the environment and learns by receiving feedback in the form of rewards or punishments for its actions. The goal of reinforcement learning is to find the optimal policy or decision-making strategy that maximizes the long-term reward.

Papers

Showing 1087610900 of 15113 papers

TitleStatusHype
Toward negotiable reinforcement learning: shifting priorities in Pareto optimal sequential decision-making0
Toward Pareto Efficient Fairness-Utility Trade-off inRecommendation through Reinforcement Learning0
Toward Real-Time Decentralized Reinforcement Learning using Finite Support Basis Functions0
Toward Reliable Designs of Data-Driven Reinforcement Learning Tracking Control for Euler-Lagrange Systems0
Toward Risk-based Optimistic Exploration for Cooperative Multi-Agent Reinforcement Learning0
Towards a Better Understanding of Representation Dynamics under TD-learning0
Towards a Deep Reinforcement Learning Approach for Tower Line Wars0
Towards a Formal Theory of the Need for Competence via Computational Intrinsic Motivation0
Towards a Fully Autonomous UAV Controller for Moving Platform Detection and Landing0
Towards a General Framework for ML-based Self-tuning Databases0
Towards AI-controlled FES-restoration of arm movements: Controlling for progressive muscular fatigue with Gaussian state-space models0
Towards AI-controlled FES-restoration of arm movements: neuromechanics-based reinforcement learning for 3-D reaching0
Towards a Metric for Automated Conversational Dialogue System Evaluation and Improvement0
Towards an Adaptable and Generalizable Optimization Engine in Decision and Control: A Meta Reinforcement Learning Approach0
Towards an Adaptive Robot for Sports and Rehabilitation Coaching0
Towards an Interpretable Hierarchical Agent Framework using Semantic Goals0
Towards Applicable Reinforcement Learning: Improving the Generalization and Sample Efficiency with Policy Ensemble0
Towards a practical measure of interference for reinforcement learning0
A step toward a reinforcement learning de novo genome assembler0
Towards a Sample Efficient Reinforcement Learning Pipeline for Vision Based Robotics0
Towards a Simple Approach to Multi-step Model-based Reinforcement Learning0
Towards a Solution to Bongard Problems: A Causal Approach0
Towards a Sustainable Internet-of-Underwater-Things based on AUVs, SWIPT, and Reinforcement Learning0
Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies0
Towards A Unified Agent with Foundation Models0
Show:102550
← PrevPage 436 of 605Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PPGMean Normalized Performance0.76Unverified
2PPOMean Normalized Performance0.58Unverified